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Preface 



The 11th International Symposium on Graph Drawing (GD 2003) was held on 
September 21-24, 2003, at the Universita degli Studi di Perugia, Perugia, Italy. 
GD 2003 attracted 93 participants from academic and industrial institutions in 
17 countries. 

In response to the call for papers, the program committee received 88 regu- 
lar submissions describing original research and/or system demonstrations. Each 
submission was reviewed by at least 4 program committee members and com- 
ments were returned to the authors. Following extensive e-mail discussions, the 
program committee accepted 34 long papers (12 pages each in the proceedings) 
and 11 short papers (6 pages each in the proceedings). Also, 6 posters (2 pages 
each in the proceedings) were displayed in the conference poster gallery. 

In addition to the 88 submissions, the program committee also received a 
submission of special type, one that was not competing with the others for a 
time slot in the conference program and that collects selected open problems in 
graph drawing. The aim of this paper, which was refereed with particular care 
and UNCHANGED two rounds of revisions, is to stimulate future research in the 
graph drawing community. The paper presents 42 challenging open problems in 
different areas of graph drawing and contains more than 120 references. Although 
the length of the paper makes it closer to a journal version than to a conference 
extended abstract, we decided to include it in the conference proceedings so 
that it could easily reach in a short time the vast majority of the graph drawing 
community. 

GD 2003 invited two distinguished lecturers. Pat Hanrahan, from Stanford 
University, gave a talk about the connection between semantic constraints and 
aesthetics in graph drawing and information visualization. Giuseppe Italiano, 
from the Universita di Roma Tor Vergata, gave a talk on algorithm engineering 
and experimental analysis of graph algorithms. 

As usual, the annual graph drawing contest was held during the conference. 
A report about the contest is included in the proceedings. 

Many people in the graph drawing community contributed to the success of 
GD 2003. In particular, the authors of submitted papers, demos, and posters are 
due special thanks, as are the members of the program committee and the exter- 
nal reviewers. Many thanks to the organizing committee members Carla Binucci, 
Emilio Di Giacomo, Luca Grilli, Maurizio Patrignani, and Maurizio Pizzonia for 
their support. My very special thanks go to the local arrangements chair Walter 
Didimo, for his invaluable help. Without his support of the organization and his 
many comments and suggestions, the conference would have been impossible to 
organize. 

Thanks are due to the industrial sponsors of the conference: the “gold” spon- 
sors, Tom Sawyer Software; the “silver” sponsors, Mitsubishi Electrics, Oreas, 
Digilab 2000, and Integra Sistemi s.r.l.; and the “contributor,” Kelyan SMC. 



VI 



Finally, many thanks go to the Dipartimento di Informatica e Automazione of 
the Universita degli Studi di Roma Tre and to the Dipartimento di Ingegneria 
Elettronica e dell’Informazione of the Universita degli Studi di Perugia for their 
help and financial support. The conference was also supported in part by “Pro- 
getto ALINWEB: Algoritmica per Internet e per il Web,” MIUR Programmi di 
Ricerca Scientifica di Rilevante Interesse Nazionale. 

The 12th International Symposium on Graph Drawing GD 2004 will be held 
in New York City, September 29-October 2, 2004, with Janos Pach as the con- 
ference chair. 
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Confluent Drawings: Visualizing Non-planar 
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Abstract. We introduce a new approach for drawing diagrams. Our 
approach is to use a technique we call confluent drawing for visualizing 
non-planar graphs in a planar way. This approach allows us to draw, in 
a crossing- free manner, graphs — such as software interaction diagrams — 
that would normally have many crossings. The main idea of this approach 
is quite simple: we allow groups of edges to be merged together and drawn 
as “tracks” (similar to train tracks). Producing such confluent diagrams 
automatically from a graph with many crossings is quite challenging, 
however, so we offer two heuristic algorithms to test if a non-planar 
graph can be drawn efficiently in a confluent way. In addition, we identify 
several large classes of graphs that can be completely categorized as being 
either confluently drawable or confluently non-drawable. 



1 Introduction 

In most graph visualization applications, graphs are often drawn in a standard 
way: the vertices of a graph are drawn as simple shapes, such as circles or boxes, 
and the edges are drawn as individual curves connecting pairs of these shapes 
(e.g., see [12,13,22]). 

Related Prior Work. There are several aesthetic criteria that have been ex- 
plored algorithmically in the area of graph drawing (e.g., see [12,13,22]). Exam- 
ples of aesthetic goals designed to facilitate readability include minimizing edge 
crossings, minimizing a drawing’s area, minimizing bends, and achieving good 
separation of vertices, edges, and angles. Of all of these criteria, however, the 
arguably most important is to minimize edge crossings, since crossing edges tend 
to confuse the eye when one is viewing adjacency relationships. Indeed, an exper- 
imental analysis by Purchase [31] suggests that edge-crossing minimization [19, 
20,25] is the most important aesthetic criteria for visualizing graphs. Ideally, we 
would like drawings that have no edge crossings at all. 

* This is an extended abstract. The full version of this paper can be found at 
http://arxiv.org/abs/cs.CG/0212046. Work by the second author is supported 
by NSF grant CCR-9912338. Work by the third and the forth author is supported 
by NSF Grants CCR-0098068, CCR-0225642, and DUE-0231467. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 1-12, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 
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Graphs that can be drawn in the standard way in the plane without edge 
crossings are called planar graphs [28], and there are a number of existing ef- 
ficient algorithms for producing crossing-free drawings of planar graphs (e.g., 
see [8,9,11,34,6,21,36] ). Unfortunately, most graphs are not planar; hence, most 
graphs cannot be drawn in the standard way without edge crossings, and such 
non-planar graphs seem to be common in many applications. There are some 
heuristic algorithms for minimizing edge crossings of non-planar graphs (e.g., 
see [19,26,20,25]), but the general problem of drawing a non-planar graph in a 
standard way that minimizes edge-crossings is NP-hard [16]. Thus, we cannot 
expect an efficient algorithm for drawing non-planar graphs so as to minimize 
edge crossings. 

The technique of replacing complete bipartite subgraphs (bicliques) with star- 
like structures is used as Edge Concentration in [27] and Factoring in [5], both 
to reduced the number of edges in the original graphs. This technique has the 
desired side effect of reducing the number of crossings, however, its primary 
goal is to minimize the total number of edges, not to minimize the number of 
crossings. Furthermore, the time complexity of approximation algorithm given 
in [27] is not desirable. Recently Lin [24] proves that the optimization problem of 
edge concentration is NP-hard. A similar idea is used in [15] for weighted graph 
compressions, where cliques and bicliques are replaced with stars. It is shown 
that the general unit weight problem is essentially as hard to approximate as 
graph coloring and maximum clique. Again, [15] doesn’t directly address the 
minimization of the number of crossings. 

Our Results. Given the difficulty of edge-crossing minimization and the ubiq- 
uity of non-planar graphs, we explore in this paper a diagram visualization 
approach, called confluent drawing, that attempts to achieve the best of both 
worlds — it draws non-planar graphs in a planar way. Moreover, we provide two 
heuristic algorithms for producing confluent drawings for directed and undirected 
graphs, respectively, focusing on graphs with bounded arboricity. 

The main idea of the confluent drawing approach for visualizing non-planar 
graphs in a planar way is quite simple — we merge edges into “tracks” so as to 
turn edge crossings into overlapping paths. (See Fig. 1.) 




Fig. 1 . An example of confluent drawing of an object-interaction diagram. We show 
a standard drawing in (a) and a confluent drawing in (b). 



Confluent Drawings: Visualizing Non-planar Diagrams 



3 



The resulting graphs are easy to read and comprehend, while also encapsulat- 
ing a high degree of connectivity information. Although we are not familiar with 
any prior work on the automatic display of graphs using this confluent diagram 
approach, we have observed that some airlines use hand-crafted confluent dia- 
grams to display their route maps. Diagrams similar to our confluent drawings 
have also been used by Penner and Harer [29] to study the topology of surfaces. 

In addition to providing heuristic algorithms for recognizing and drawing 
confluent diagrams, we also show that there are large classes of non-planar graphs 
that can be drawn in a planar way using our confluent diagram approach. 

2 Confluent Drawings 

It is well-known that every non-planar graph contains a subgraph homeomorphic 
to the complete graph on five vertices, K 5 , or the complete bipartite graph be- 
tween two sets of three vertices, K 3}3 (e.g., see [3]). On the other hand, confluent 
drawings, with their ability to merge crossing edges into single tracks, can easily 
draw any K nm or K n in a planar way. Fig. 2 shows confluent drawings of K 3}3 
and K 3 . 



A curve is locally-monotone if it contains no self intersections and no sharp 
turns, that is, it contains no point with left and right tangents that form an 
angle less than or equal to 90 degrees. Intuitively, a locally-monotone curve is 
like a single train track, which can make no sharp turns. Confluent drawings are 
a way to draw graphs in a planar manner by merging edges together into tracks, 
which are the unions of locally-monotone curves. 

An undirected graph G is confluent if and only if there exists a drawing A 
such that: 

— There is a one-to-one mapping between the vertices in G and A, so that, for 
each vertex v £ V(G), there is a corresponding vertex v' £ A, which has a 
unique point placement in the plane. 

— There is an edge ( Vi,Vj ) in E(G) if and only if there is a locally-monotone 
curve e' connecting v[ and v'j in A. 

— A is planar. That is, while locally-monotone curves in A can share overlap- 
ping portions, no two can cross. 




Fig. 2. Confluent drawings of K 3t3 and K 5. 
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Our definition does not allow for confluent graphs to contain self loops or 
parallel edges, although we do allow for tracks to contain cycles and even multiple 
ways of realizing the same edge. Moreover, our definition implies that tracks in 
a confluent drawing have a “diode” property that does not allow one to double- 
back or make sharp turns after one has started going along a track in a certain 
direction. 

Directed confluent drawings are defined similarly, except that in such draw- 
ings the locally-monotone curves are directed and the tracks formed by unions 
of curves must be oriented consistently. 

3 Heuristic Algorithms 

Though the planarity of a graph can be tested in linear time, it appears difficult 
to quickly determine whether or not a graph can be drawn confluently. If a graph 
G contains a non-planar subgraph, then G itself is non-planar too. But similar 
closure properties are not true for confluent graphs. Adding vertices and edges 
to a non-confluent graph increases the chances of edges crossing each other, 
but it also increases the chances of edges merging. Currently, the best method 
we know of for determining conclusively in the worst case whether a graph is 
confluent or not is a brute force one of exhaustively listing all possible ways of 
edge merging and checking the merged graphs for planarity. Therefore, it is of 
interest to develop heuristics that can find confluent drawings in many cases. 

Fig. 3 shows confluent drawings using a “traffic circle” structure for complete 
subgraphs (cliques) and complete bipartite subgraphs (bicliques). At a high level, 
our heuristic drawing algorithm iteratively finds clique subgraphs and biclique 
subgraphs and replaces them with traffic-circle subdrawings. 





Fig. 3. Confluent drawings of K 5 and K 3,3 using “traffic circle” structures. 



Chiba and Nishizeki [7] discuss the problem of listing complete subgraphs 
for graphs of bounded arboricity. The arboricity a(G) is the minimum number 
of forests into which the edges of G can be partitioned. The listing algorithm is 
applicable for such graphs. Chiba and Nishizeki show that there can be at most 
0(n) cliques of a given size in such graphs and give a linear time algorithm for 
listing these clique subgraphs. Eppstein [14] gives a linear time algorithm for 
listing maximal complete bipartite subgraphs in graphs of bounded arboricity. 
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The total complexity of all such graphs is O(n), and again they can be listed in 
linear time. 

In our heuristic algorithm for undirected graphs, we will use the clique sub- 
graphs listing and the biclique subgraphs listing algorithms as our subroutines. 

HeuristicDrawUndirected(G) 

Input. A undirected sparse graph G. 

Output. Confluent drawing of G if succeed, fail otherwise. 

1. If G is planar 

2. draw G 

3. else if G contains a large clique or biclique subgraph C 

4. create a new vertex v 

5. obtain a new graph G' by removing edges of C and 
connecting each vertex of C to v 

6. HeuristicDrawUndirected(G') 

7. replace v by a small “traffic circle” to get a confluent 
drawing of G 

8. else fail 

Fig. 4. 



In step 3 of the algorithm in Fig. 4, the cliques are given higher priority over 
bicliques, otherwise a clique would be partially covered by a biclique. Cliques of 
three or fewer vertices, and bicliques with one side consisting of only one vertex, 
are not replaced because the replacement cannot change the planarity of the 
graph. We now discuss the time performance of this heuristic. 

Theorem 1. In graphs of bounded arboricity, algorithm HeuristicDrawUn- 
directed can be made to run in time 0(n), assuming hash tables with constant 
time per operation. 

Proof. We store a bit per edge of the original graph so we can quickly look up 
whether it is still part of our replacement. We begin the heuristic by looking for 
cliques, since we want to give them priority over bicliques. List all the complete 
subgraphs in the graph with four or more vertices, and sort them by size (the size 
of the complete subgraph is bounded too in graphs with bounded arboricity). 
Then, for each complete subgraph X in sorted order, we check whether X is 
still a clique of the modified graph, and if so perform a replacement of X. It is 
not hard to see that the new vertex v of the replacement cannot belong to any 
clique, so this algorithm correctly finds a maximal sequence of cliques to replace. 

Next, we need to similarly dynamize the search for bicliques. This is more dif- 
ficult, because a biclique may have nonconstant size and because the replacement 
vertex v may belong to additional bicliques. We perform this step by dynamizing 
the algorithm of Eppstein [14] for listing all bicliques. This algorithm uses the 
idea of a d-bounded acyclic orientation: that is, an orientation of the edges of the 
graph, such that the oriented graph is acyclic and the vertices have maximum 
outdegree d. For graphs of arboricity a, a (2 a — l)-bounded acyclic orientation 
may easily be found in linear time. For such an orientation, define a tuple to be 
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a subset of the outgoing neighbors of any vertex, and let v be a tuple creator of 
tuple T if all vertices of T are outgoing neighbors of v. For graphs of bounded 
arboricity, there are at most linearly many distinct tuples. For each maximal 
biclique, one of the two sides of the bipartition must be a tuple, T [14]. The 
other side consists of two types of vertices: tuple creators of T, and outgoing 
neighbors of vertices of T. 

Our algorithm stores a hash table indexed by the set of all tuples in the mod- 
ified graph. The hash table entry for tuple T stores the number of tuple creators 
of T, and a list of outgoing neighbors of vertices of T that are adjacent to all 
tuple members. For each edge (u, v ) in the graph, oriented from u to v, we store 
a list of the tuples T containing v for which u is listed as an outgoing neighbor. 
We also store a priority queue of the maximal bicliques generated by each tuple, 
prioritized by size; it will suffice for our purposes if the time to find the largest 
biclique is proportional to the biclique size, and it is easy to implement a pri- 
ority queue with such a time bound. With these structures, we may easily look 
up each successive biclique replacement to perform in algorithm Heuristic- 
DrawUndirected. Each replacement takes time proportional to the number 
of edges removed from the graph, so the total time for performing replacements 
is linear. 

It remains to show how to update these data structures when we perform a 
biclique replacement. To update the acyclic orientation, orient each edge from C 
to v, except for those edges from vertices of C that have no outgoing edges in C. 
It can be seen that this orientation preserves d-boundedness and acyclicity. When 
a new vertex v is created by a replacement, create the appropriate hash table 
entries for tuples containing u; the number of tuples created by a replacement 
is proportional to the number of edges removed in the same replacement, so 
the total number of tuples created over the course of the algorithm is linear. 
Whenever a replacement causes edges from a vertex x to change, update the 
hash entries for all tuples for which a: is a creator; this step takes 0(1) time 
per change. Also, update the hash entries for all tuples to which x belongs, to 
remove vertices that are no longer outgoing neighbors of x ; this step takes time 
0(1) per changed tuple, and each tuple changes 0(1) times over the course of 
the algorithm. Whenever a change removes incoming edges of x, we must remove 
the other endpoints of those edges from the lists of outgoing neighbors of tuples 
to which x belongs; using the lists associated with each incoming edge, this takes 
constant time per removal. Therefore, all steps can be performed in linear total 
time. □ 

An example of the input for algorithm HeuristicDrawUndirected and 
the output drawing produced by this heuristic is shown in Fig. 5. 



4 Some Confluent Graphs 

The heuristic algorithms presented in the previous section are most applicable 
to sparse graphs, because sparseness is needed for the linear time bound of the 
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Fig. 5. An example of running the undirected heuristic algorithm. The input graph is 
shown in (a) and the output drawing is shown in (b). 



maximal bipartite subgraph listing subroutine. However, there are also several 
denser classes of graphs that we can show to be confluent. 

Interval graphs. An interval graph is formed by a set of closed intervals S = 
{[ai, 61], [02,62]) • • [a„, 6„]}. The interval graph is defined to have the intervals 

in S' as its vertices and two vertices [a*, 6,;] and [aj, bj] are connected by an edge 
if and only if these two intervals have a non-empty intersection. Such graphs are 
typically non-planar, but we can draw them in a planar way using a confluent 
drawing 1 . 

Theorem 2. Every interval graph is confluent. 

Proof. The proof is by construction. We number the interval endpoints by rank, 
X = { 0 , 1 , . . . , n — 1 }, and place these endpoints along the z-axis. We then build 
a two-dimensional lattice on top of these points in a fashion similar to Pascal’s 
triangle, using a connector similar to an upside-down “V”. These connectors 
stack on top of one another so that the apex of each is associated with a unique 
interval on X. We place each point from our set S of intervals just under its 
corresponding apex and connect it into the (single) track so that it can reach 
everything directly dominated by this apex in the lattice. At the bottom level, 
we connect the updside-down V’s with rounded connectors. By this construction, 
we create a single track that allows each pair of vertices connected in the interval 
graph to have a locally-monotone path connecting them. (See Fig. 6.) □ 



Complements of trees. The complements of trees (graphs formed by con- 
necting all pairs of vertices that are not connected in some tree) are also called 
cotrees. In general, cotrees are highly non-planar and dense, since a cotree with 
n vertices has n(n — l )/2 — n + 1 edges. Nevertheless, we have the following 
interesting fact. 

Theorem 3. The complement of a tree is confluent. 

1 A similar construction works for circular-arc graphs and is left as an exercise for the 
interested reader. 
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Fig. 6. Illustrating a confluent way to draw a non-planar interval graph: (a) an interval 
graph and its defining intervals; (b) its corresponding confluent drawing. 



Proof. We prove the claim by recursive construction, using a single track for the 
entire graph. Assign a bounding rectangle for the tree and a bounding rectangle 
for every subtree in that tree. Place the complement of the tree into the bounding 
rectangles such that nodes of every subtree is within its bounding rectangle and 
the bounding rectangles of subtrees are contained in their parent’s bounding 
rectangle. In addition, place a connector at the Northeastern corner of every 
bounding box. This connector is an imaginary point at which the single track for 
the entire graph will connect into this portion of the cotree. (See Fig. 7.) Connect 
the root node in each subtree to every connector of its children. Connect every 
node to the connector of its parent. Also connect every node to its siblings and 
the connectors of its siblings, as shown in the figure. The obtained drawing is 
the confluent drawing of the complement of the given tree. □ 





Fig. 7. Illustrating a confluent way to draw the complement of a tree: (a) a node and its 
children in the tree; (b) the corresponding portion of a track in the confluent drawing 
of the complement. 



Paths are very special cases of trees. Every vertex in a path has a degree of 2 
except its two endpoints, each of which has a degree of 1. The complement of a 
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path can be drawn using the cotree method in the above proof. We show a nice 
confluent drawing of the complement of a path in Fig. 8. 



A 7.17.1 



Fig. 8. P 8 and the confluent drawing of Pg. 



Complements of n-cycles. An n-cycle is a cycle with n vertices. 

Theorem 4. The complement of an n-cycle is confluent. 

Proof. First remove one vertex from the n-cycle and draw the confluent graph 
for the complement of the obtained path. Then add the vertex back and connect 
it with all vertices in the path except for its two neighbors. The obtained drawing 
is a confluent drawing (See Fig. 9 for an example.) □ 




r — ~c — 7 

Fig. 9. A confluent drawing of Cs. 



Cographs. A complement reducible graph (also called a cograph ) is defined 
recursively as follows [10]: 

— A graph on a single vertex is a cograph. 

— If Gi, G 2 , • • •, Gk are cographs, then so is their union Gi U G 2 U • • • U Gk- 

— If G is a cograph, then so is its complement G. 

Cographs can be obtained from single node graphs by performing a finite 
number of unions and complementations. 

Theorem 5. Cographs are confluent. 

Proof. If cographs A and B are confluent, we can show AU B and AU B are 
confluent too. First we draw A confluently inside a disk and attach a “tail” to 
the boundary of the disk. Connect the attachment point to each vertex in the 
disk. B is drawn in the same way. Then A U B is formed by joining the two 
“tails” together so that they don’t connect to each other. A U B is formed by 
joining the two “tails” of A and B together so that they connect to each other. 
(See Fig. 10.) By the definition of cographs and induction we know cographs are 
confluent. □ 
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Fig. 10. Confluent AVJ B and A U B. 



5 Some Non-confluent Graphs 

In this section, we show that some graphs cannot be drawn confluently. These 
graphs include the Petersen graph P, the graph P — v formed by removing one 
vertex from the Petersen graph, and the 4-dimensional hypercube. 

The Petersen graph. By removing one vertex and its incident edges from the 
Petersen graph we obtain a graph homeomorphic to P'33. It contains no P'2,2 
as a subgraph. Moreover, note that P’2,2 is the most basic structure that allows 
for edge merging into tracks. Thus the resulting graph is non-confluent. This 
graphis the smallest non-confluent graph we know of. 

The Petersen graph itself is also non-confluent, as adding the vertex and edges 
back to its non-confluent subgraph doesn’t create any four-cycles that could be 
used for confluent tracks. 

4-dimensional hypercube. The 4-dimensional hypercube in Fig. 11 is non- 
confluent. 
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Fig. 11. The 4-dimensional hypercube. 



The hypercube contains many subgraphs isomorphic to 3-dimensional cubes. 
Cubes are planar graphs, but in order to show non-confluence for the hypercube, 
we analyze more carefully the possible drawings of the cubes. Observe that, 
because there are no P'2,3 subgraphs in cubes or hypercubes, the only possible 
confluent tracks are P'2,2 ’s formed from the vertices of a single cube face. 

Proof. (We omit the proof in this extended abstract.) 

Acknowledgments. We would like to thank Jie Ren and Andre van der Hoek 
for supplying us with several examples of graphs used in software visualization. 
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Abstract. We present an extensive experimental study of heuristics for 
crossing minimization. The heuristics are based on the planarization ap- 
proach, so far the most successful framework for crossing minimization. 
We study the effects of various methods for computing a maximal pla- 
nar subgraph and for edge re-insertion including post-processing and 
randomization. 



1 Introduction 

The crossing minimization problem is one of the crucial problems in graph draw- 
ing (see, e.g., [22]). Here, the task is to find a drawing of a graph in the plane 
with the minimum number of crossings. The crossing number represents a fun- 
damental measure of non-planarity of graphs and has been studied for more 
than 40 years by graph theorists. The algorithmical problem of computing the 
crossing number has also been studied in the context of VLSI-layout. So far, only 
a few infinite classes of graphs exist, for which the crossing number is known. 
We do not even know the asymptotic value for the complete graph K n with n 
vertices and for the complete bipartite graph K n , n with 2 n vertices, as n tends 
to infinity [23]. 

We do know, however, that the crossing number problem and several of 
its variants are NP-hard [13,3]. While many other prominent NP-hard prob- 
lems have been successfully attacked with integer programming and branch-and- 
bound techniques, no similar approach to the crossing number problem is known 
to date. To our knowledge, no exact algorithm exists that is able to solve even 
small instances of the crossing number problem to provable optimality within 
reasonable computation time. For graphs with bounded degrees, Even et al. [8] 
have recently suggested an approximation algorithm in which the sum of the 
numbers of vertices and crossings is 0(log 3 |V|) times the minimum sum thus 
improving the results of 0( log 4 | V|) by Bhatt and Leighton [2] and Leighton and 
Rao [17]. Grohe [9] has given an exact algorithm that works in quadratic time if 
the crossing number is fixed. Both algorithms are rather of theoretical nature and 
have so far not been useful for solving practical instances. Much easier to tackle 
is the bipartite crossing number, for which a vast amount of experimental pa- 
pers exist in the literature (see, e.g., [16,7]). Experimental papers in other fields 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 13-24, 2004. 
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include, e.g., a collection of experimental studies in graph drawing by Vismara 
et al. [24] and the paper by Brandenburg et al. on force-directed methods [4]. 

The only experimental state-of-the-art study - to our knowledge - concerning 
crossings for general graphs has been published in [6]. It includes four different 
algorithms for orthogonal graph drawing on the benchmark set described above. 
Two of these algorithms were based on the topology-shape-metrics approach, 
and the others are incremental algorithms that focus on a small area and a small 
number of bends. The results show that the former two algorithms are superior 
in terms of the criteria number of crossings, number of bends, area, and edge 
length. E.g., the best algorithm had up to 8 times fewer crossings. Therefore, we 
decided to solve the crossing minimization problem heuristically using a 2-step 
planarization approach, which we will discuss in Section 3. There, we will also 
describe our strategies (some of them are new) for crossing reduction. Section 4 
contains a discussion of our extensive experimental results. Our paper ends with 
a section on conclusions (see Section 5) . Before we can start the technical part of 
the paper, we need to introduce some basic mathematical terms (see Section 2) . 

2 Preliminaries 

In a drawing of a graph G = (V, E) each vertex v £ V is mapped to a distinct 
point p v in the plane and each edge (u, v) € E is mapped to a closed simple 
curve that connects the points p u and p v and does not pass through the image 
of any other vertex. If two curves share an interior point p , we say that they 
cross at p. The crossing number cr(G) is the minimal number of crossings in 
any drawing of G. The crossing number problem is the problem of finding the 
crossing number for a given graph G. 

The graphs that can be drawn without any edge crossings are called planar 
graphs. A planar drawing of a graph divides the plane into regions called faces. 
Every drawing defines a planar and a combinatorial embedding of the graph G. 
Such an embedding essentially fixes the topology of the graph. A combinatorial 
embedding is defined as a clockwise ordered list of adjacent neighbors for each 
vertex v £ V. When, in addition, the outer face is fixed, the combinatorial 
embedding is also called a planar embedding of G. An alternative definition of 
a combinatorial embedding is for each face / an anti-clockwise ordered list of 
the edges bordering /. Given a planar graph, a combinatorial embedding can be 
computed in linear time [5,19] . In general, a planar graph can have an exponential 
number of combinatorial embeddings. In the following section, we will use the 
name embedding for planar or combinatorial embedding. 

3 The Planarization Approach 

In practice, the crossing minimization problem is solved heuristically using a 
2-step planarization approach. In a first step, a small number of edges is deleted 
from G = (V, E ) in order to obtain a planar graph P. In a second step, the 
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edges are re-inserted into the planar graph P while trying to keep the number 
of crossings small. 

3.1 Methods for Computing the Planar Subgraph 

For the first step, we need to solve the so-called maximum planar subgraph prob- 
lem , which has been shown to be NP-hard [18]. If the number of edges to be 
deleted is small, the exact branch-and-cut algorithm suggested in [15] is able 
to provide a provably optimal solution quite fast. However, the method is quite 
complicated to understand and to implement: Moreover, if the number of deleted 
edges exceeds 10, the algorithm usually needs far too long to be acceptable for 
practical computation. Since we are interested in approaches for practitioners, 
we did not involve this exact method into our studies. Interested readers are 
refered to the study of Ziegler [25] concerning the number of deleted edges in 
the Rome library benchmark set. 

A widely used standard heuristic for finding a maximal planar subgraph 
is to start with the empty graph, and to iteratively try to add the edges one 
by one. In every step, a planarity testing algorithm is called for the obtained 
graph. If the addition of an edge would lead to a non-planar graph, then the 
edge is disregarded; otherwise, the edge is added permanently to the planar 
graph obtained so far. After \E\ iterations (planarity tests), we have obtained a 
maximal planar subgraph P of G, i.e. , a subgraph of G which will get non-planar 
as soon as any of the edges in G — P will be added. We will denote this method 
as MAXIMAL. The standard (and also our) implementation needs a running 
time of 0(| £'|| VI). Theoretically, this can be improved to nearly linear running 
time using so-called online-planarity testing algorithms (e.g., [1,21]), but we are 
not aware of any correct implementation. 

An alternative to this method is to use the planarization algorithm based on 
PQ-trees suggested in [12,14]. Observe, that this method cannot guarantee to 
derive a maximal planar subgraph. The theoretical worst case running time is 
0(|V| 2 ). In practice it is much faster. 

The quality of the results can be improved by introducing random events and 
calling them several times. The PQ-tree based algorithm starts by computing a 
so-called st-numbering. Our random event was simply to choose a random edge 
of E in order to get s and t. We studied the effects of up to 100 calls. We denote 
these methods as PQ1, PQ10, PQ50, and PQ100 for 1, 10, 50, and 100 iterations. 

3.2 Edge Re-insertion Strategies 

Fixed Embedding. Also the edge re-insertion step is a NP-hard optimiza- 
tion problem [25]. The standard algorithm used in practice re-inserts the edges 
ei, e 2 , ,ek iteratively starting with a given planar embedding of G. The ap- 
proach is based on the observation that an edge ei crosses an edge in P if and 
only if it uses an edge in the geometric dual graph of P. Hence, the problem of re- 
inserting only one edge into P with a given planar embedding can be solved via 
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a simple shortest-path computation in the extended dual graph of P. (We need 
to extend the dual graph in order to connect the end-vertices of with the dual 
graph.) After each insertion step i, the crossings generated by edge e, are substi- 
tuted by artificial vertices so that the resulting graph G U {ei, . . . , e^} becomes 
planar again (i = 1 , ,k). The theoretical worst case running time for inserting 
k edges of our implementation is 0(^*_ 1 (\V\ + X^=o c i)) = 0(fc(|V| + |C|)), 
where Cj is the number of crossings introduced in step j, Cq = 0, and C the 
number of crossings in the final drawing. In practice, it is much faster, since the 
update of the dual graphs are implemented efficiently. We denote this re-insertion 
method as FIX. 

Variable Embedding. However, the quality of the resulting drawing highly 
depends on the chosen embedding for P. In [11], Gutwenger et al. have given a 
linear time algorithm based on the SPQR-tree data structure for inserting one 
edge into a planar graph P so that the number of crossings in PU{e} over the set 
of all possible planar embeddings of P is minimized. Our implementation has the 
same theoretical running time as the variant FIX. We denote this re-insertion 
method as VAR. 

Constrained Crossing Minimization. Obviously, re-insertion of all edges 
at the same time will improve the solution. However, no practically efficient 
algorithm is known. The constrained crossing minimization problem asks for the 
minimum number of crossings obtained by a set of edges F when inserted into 
a planar graph P, while the embedding of P is not changed. The problem has 
been investigated in [20,25]. Experiments show that it can only be solved to 
provable optimality if there are less than 10 re-inserted edges — and even then, 
the running time is relatively high. Therefore, we did not include this method 
into our experiments. 

Post-Processing Strategies. The idea of the post-processing strategies is to it- 
eratively delete an edge from the drawing and to re-insert it again. Our strategies 
vary in the set and/or number of edges involved in the deletion and re-insertion 
process, and the order of processing them. 

The variant INS involves exactly those edges which have been deleted already 
in the planar subgraph step, whereas the variants ALL and MOST involve the 
whole set of edges E in the original graph G. An iteration takes either the 
whole set (in variant INS and ALL) or x% of this edges (variant MOST x%) 
iteratively (one after the other) . The procedure stops only if within one iteration 
no improvement has been made. How do we choose the edges in variant MOST 
x% ? After each iteration, we sort the involved edge set in descending order 
according to the number of crossings they are involved in. Now, only the first 
x% edges of this list are taken for re-insertion. The variant in which there is no 
post-processing routine is called NONE. 

The post-processing procedure can be implemented efficiently by updating 
the dual graph only at those regions, in which changes did occur. We did this for 
our FIX strategy. In principal, such an update is also possible for the VARIABLE 
embedding setting [1]. However, we are not aware of any implementation of this 



An Experimental Study of Crossing Minimization Heuristics 



17 



algorithm. This explains the big running time discrepancy in the post-processing 
procedure between the FIXED and VARIABLE embedding setting. 

Permutations. After a whole deletion and re-insertion process of the cho- 
sen strategy for embedding FIX/ VAR and a strategy for post-processing 
NONE/INS/ ALL/MOST, we get a certain crossing number. Our permutation 
variant does nothing else, but repeating the whole edge re-insertion process and 
keeping the best results. The random effect exists in choosing a different ordering 
of the edges in G — P for the initial re-insertion step. The notation PERMi gives 
the number of these repetition rounds. We have experimented with PERM1, 
PERM2, PERM10, and PERM20. 

4 Experimental Results 

For our computational experiments, we have used our new graph drawing li- 
brary which is the basis of the GoVisual layout tools [10]. Our computational 
experiments ran on a PC with Pentium 4, 2.4 GHz, 512 MB RAM, under 
MS Windows 2000, and our C++-code has been compiled with MS Visual 
C++. NET (Visual C++ 7). We used the benchmark set which is commonly 
known as the Rome library. It contains 11.528 graphs 1 with 10 to 100 ver- 
tices, and has been generated from a core set of 112 graphs used in real-life 
software engineering and database applications. The library is available via 
http : / /www . dia. uniroma3 . it/people/gdb/ wpl2/undirected-l . tar . gz. 

4.1 Results of the Planar Subgraph Computations 

In our first experiments, we measured the average number of deleted edges of the 
five strategies for computing a planar subgraph for each group with vertex size 
i, i = 10, . . . , 100, separately. The results improve significantly as the number of 
permutations increases. While the number of deleted edges went up to 19 (on 
average) for PQ1, it was about 16 for PQ10, and 15 for PQ50 and PQ100. It 
seems that it really pays off to run the planar subgraph algorithm many times. 
However, the results for 50 and 100 permutations are very close to each other. 
This effect comes from the random effect chosen for our implementation (see Sec- 
tion 3.1). By choosing other randomization techniques, this effect may occur at 
a higher number of permutations. Since the running time of our implementation 
is relatively small, it seems that taking 100 iterations of the PQ-based algorithm 
(PQ100) is a good choice. The running time for PQ1 was below 0.002 seconds, 
while the time for MAXIMAL increased to 0.022 seconds at 100 vertices. The 
time for PQ100 was about 0.34 seconds for instances with 100 vertices. 

4.2 Results for the Edge Re-insertion Step 

FIX vs. VAR. Figure 1 shows the number of crossings generated with the 
variants PQ1 vs. PQ100 and FIX vs. VAR. The VAR strategy seems to get 

originally, it were 11,582 graphs, but some of the hies are corrupted 
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better results than the FIX strategy. This effect is outperformed, however, by 
starting with a better planar subgraph for edge re-insertion (i.e. , taking PQ100 
instead PQ1). Because of this, we decided to stay with PQ100 for our further 
experiments. 




Fig. 1 . The effect of the edge re-insertion strategies FIX and VAR. 



On the instances with 100 vertices we get 60.32 crossings on average. This is 
about the same number which was also reported in the study by Di Battista et 
al. [6] . By computing a better planar subgraph the number of crossings reduces 
from 60.32 to 46.97 (about 22% improvement). This number can further be 
reduced to 42.37 by choosing the best embedding for each inserted edge. This is 
already an improvement of about 30%. 

For the remaining Figures, we decided to show the relative improvement of 
the variants acording to the standard method PQ1-FIX-NONE-PERM1, which 
was also used in [6]. 

Post-Processing. Figure 2 shows the effect of the post processing variants 
for the VAR strategy. Taking the inserted edges as candidates for re-insertion 
is already much better than the NONE strategy (no post-processing at all). 
However, the results can be improved a lot more by taking the whole set of 
edges into account. We observe that already taking into account 25% of the edges 
gives similar results to the options MOST100 and ALL. ALL, MOST100, and 
MOST25 improve the number of crossings up to 46.11% (corresponds to 32.5 
crossings at 100 vertices) compared to the standard method. For the FIXED 
strategy, the picture looks very similar. Here, the improvement coming from 
post-processing in comparison to NONE is a little bit better. 

We also measured the running times of the post processing variants. Strate- 
gies INS and MOST10 have about the same running time (up to 0.18 seconds), 
which is up to a factor of 2.5 slower than the NONE variant. MOST25 needs 
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Fig. 2. The effect of the post processing variants compared to the standard method. 



up to 0.35 seconds. Not surprisingly, MOSTIOO and ALL have a similar running 
time (up to 0.53 seconds), and are the slowest variants. For the FIX strategy, 
the running time of all versions is about the same. The time for variant PQ100- 
FIX-MOST100-PERM1 is only 0.01 seconds larger than the time for variant 
PQ100-FIX-NONE-PERM1. This comes from the efficient update operations 
mentioned in Section 3.2. 

Permutations. Figure 3 shows the effect of the permutation variants for FIX. 
As expected, the results improve with a higher permutation number. However, we 
observe that the post-processing effects are stronger than the PERM20 effects: 
all four strategies PQlOO-FIX-MOSTlOO-PERMi lead to much better results 
(between 41.77% and 50.25% improvement at 100 vertex instances) than the 
four strategies PQlOO-FIX-NONE-PERMi (i = 1,2,10,20) (between 22.12% 
and 32.36%). 

Obviously, the running time increases a lot: PERM20 needs about 20 times 
the running time of PERM1 (since the edge re-insertion step dominates the 
procedure). For VAR, the situation is similar. The only difference is that the 
upper three curves are closer to the others. 

PQ1 vs. PQ100. Finally, we have tested the effects of the maximal planar sub- 
graphs heuristics PQ1 and PQ100 again for the best edge re-insertion strategies. 
Figure 4 shows that PQ100-FIX-ALL-PERM20 gives an improvement of 49.55% 
compared to the standard method, whereas PQ 1-FIX- ALL-PERM20 gives an 
improvement of 47.22%. The running times for both variants are about the 
same. Interestingly, this is not true for the strategies PQ100-VAR-ALL-PERM1 
and PQ1-VAR-ALL-PERM1. Due to the better planar subgraph achieved with 
PQ100-VAR-ALL-PERM1, the running times differ by 0,21 seconds at the in- 
stances with 100 vertices (0.48 compared to 0.69). Surprisingly, the PQ100 strat- 
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Fig. 3. The effect of the PERM variants. 




Fig. 4. The relative effect of the maximal planar subgraph variants. 



egy is faster than the PQ1 strategy. The relative improvements in the number 
of crossings are 45.15% and 41.03%, respectively. 

Conclusions. We have run all possible experiments with strategies FIX and 
VAR, the different post-processing strategies NONE and ALL, and the permu- 
tation numbers 1 and 10. Figure 5 shows the results for these eight strategies. 
The best results gives PQ100-VAR-ALL-PER.M10 (51.63% improvement at 100 
vertex instances corresponding to 29.18 crossings) followed by PQ100-FIX-ALL- 
PERM10 (48.42% corresponding to 31.11 crossings). The three strategies us- 
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Fig. 5. The results for a collection of strategies. 
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Fig. 6. The running times for the results displayed in Figure 5. 



ing post-processing ALL are the four winners in this Figure. Interestingly, the 
best method without any post-processing, PQIOO-VAR-NONE-PERMIO, is in 
competition with the strategy PQ100-FIX-ALL-PERM1 (with up to 45% im- 
provement). This means that, in order to be competitive with post-processing, 
10 permutations are not enough. Additional help is needed, here in form of the 
strategy VAR compared to FIX. 

Figure 6 shows the running times for the results displayed in Figure 5. We ob- 
serve that the running time of PQIOO-VAR-NONE-PERMIO is much slower than 
the running time of PQ 100-FIX- ALL-PERM1. Hence PQ100-FIX-ALL-PERM1 
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is the clear winner among these two strategies. The best strategy displayed in 
Figure 6, PQ100-VAR-ALL-PERM10, has the highest running time. Interest- 
ingly, the second best strategy, PQ100-FIX-ALL-PERM10, has the fifth best 
running time (up to 0,14 seconds). 

Table 1. The ranking list of the best heuristics. 



Rank 


Crossings 


Time (sec) 


MPS 


FIX / VAR 


POST 


PERM 


1 


28.56 


8.778 


PQ100 


VAR 


ALL 


PERM20 


2 


28.61 


8.563 


PQ100 


VAR 


MOST100 


PERM20 


3 


28.66 


5.902 


PQ100 


VAR 


MOST25 


PERM20 


4 


29.09 


4.359 


PQ100 


VAR 


MOST100 


PERM10 


5 


29.35 


3.060 


PQ100 


VAR 


MOST25 


PERM10 


6 


30.01 


0.259 


PQ100 


FIX 


MOST100 


PERM20 


7 


30.43 


0.258 


PQ100 


FIX 


ALL 


PERM20 


8 


30.62 


0.130 


PQ100 


FIX 


MOST100 


PERM10 


9 


31.11 


0.128 


PQ100 


FIX 


ALL 


PERM10 


10 


31.64 


0.112 


PQ100 


FIX 


MOST25 


PERM10 


11 


33.16 


0.054 


PQ100 


FIX 


MOST100 


PERM2 


12 


33.29 


0.053 


PQ100 


FIX 


ALL 


PERM2 


13 


34.14 


0.050 


PQ100 


FIX 


MOST25 


PERM2 


14 


35.12 


0.046 


PQ100 


FIX 


MOST100 


PERM1 


15 


35.29 


0.045 


PQ100 


FIX 


ALL 


PERM1 


16 


36.09 


0.043 


PQ100 


FIX 


MOST25 


PERM1 


17 


38.93 


0.040 


PQ100 


FIX 


MOST10 


PERM1 


18 


46.98 


0.036 


PQ100 


FIX 


NONE 


PERM1 


19 


60.32 


0.002 


PQ1 


FIX 


NONE 


PERM1 



Table 1 shows a ranking list of the best obtained results in the following sense: 
We have sorted all strategies according to their average crossing number achieved 
at the instances with 100 vertices. Running through this list, we deleted all those 
strategies, which have obtained worse crossing number and worse running time 
than another strategy on the list. It is interesting that the best five strategies 
are based on VAR, and all of the remaining strategies in our ranking list are 
based on FIX. This stems from the fact, that from a certain point on, the VAR 
strategy takes much more time than the FIX strategy. E.g., the strategy PQ100- 
VAR-ALL-PERM1 has been eliminated from the table through PQ100-FIX- 
MOST100-PERM20. It seems that the effect coming from the permutations is 
stronger than the positive VAR effect, at least until a certain point. 

Figure 7 shows a selection of the best obtained results listed in Table 1. 
The winners are PQ100-VAR-ALL-PERM20 (with improvement of 52.65% cor- 
responding to 28.56 crossings) and PQ100-VAR-MOST25-PERM20, followed by 
PQIOO-VAR-MOSTIOO-PERMIO and PQ 100-VAR-MOST25-PERM10. Only 
then, the FIX strategies appear on the list. 
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Fig. 7. The best obtained results within our study. 



5 Conclusions 

We have conducted extensive experimental studies on the crossing minimization 
problem for a benchmark set of graphs. The main conclusions are: 

1. Post-processing always helps. It is recommended not to restrict the post- 
processing procedure to the inserted edges. Already re-inserting 25% of all 
the edges helps a lot. 

2. Permutations and random effects help, but not as well as post-processing. 

3. It is important to start with a good planar subgraph. A better subgraph 
leads not only to an improved number of achieved crossings, but also to an 
improved running time of the algorithm. 

4. The re-insertion within a variable embedding setting is still worth doing, 
even if post-processing is used. 

Acknowledgement. We thank Sebastian Leipert for providing his implemen- 
tation of the PQ-based planarity testing, planarity embedding, and planar sub- 
graph algorithms. 
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Abstract. In this paper we give a new description of the planarity test- 
ing and embedding algorithm presented by Boyer and Myrvold [2], pro- 
viding, in our opinion, new insights on the combinatorial foundations of 
the algorithm. Especially, we give a detailed illustration of a fundamental 
phase of the algorithm, called walk-up, which was only succinctly illus- 
trated in [2], Also, we present an implementation of the algorithm and 
extensively test its efficiency against the most popular implementations 
of planarity testing algorithms. Further, as a side effect of the test activ- 
ity, we propose a general overview of the state of the art (restricted to 
efficiency issues) of the planarity testing and embedding field. 



1 Introduction 

Testing the planarity of a graph is one of the most fascinating and intriguing 
problems of the graph drawing and of the graph algorithms fields. 

In 1974 Hopcroft and Tarjan [10] proposed the first linear-time planarity 
testing algorithm. This algorithm, also called “path-addition algorithm,” starts 
from a cycle and adds to it one path at a time. However, the algorithm is so 
complex and difficult to implement that several other contributions followed 
their breakthrough. For example, about twenty years after [10], Mehlhorn and 
Mutzel [13] contributed a paper to clarify how to construct the embedding of a 
graph that is found to be planar by the original Hopcroft and Tarjan algorithm. 

A different approach has its starting point in the algorithm presented by 
Lempel, Even, and Cederbaum [12]. This algorithm, also called “vertex addition 
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algorithm,” is based on considering the vertices one-by-one, following an st- 
numbering; it has been shown to be implementable in linear time by Booth and 
Lueker [1], Also in this case, a further contribution by Chiba, Nishizeki, Abe, 
and Ozawa [5] has been needed for showing how to construct an embedding of 
a graph that is found planar. 

A further interesting algorithm is based on a characterization given by de 
Fraysseix and Rosenstiehl [6] and, although it has not been fully described in the 
literature, it has a very efficient implementation in the Pigale software library [7]. 

However, the story of the planarity testing algorithms enumerates several 
more recent contributions, aimed at further exploring the relationships between 
planarity and Depth First Search (DFS) and at devising algorithms that are 
easier to understand and to implement. 

Two recent DFS-based planarity testing algorithms are those presented by 
Shih and Hsu [15,16,11] and by Boyer and Myrvold [2]. Shih and Hsu algorithm 
replaces biconnected portions of the graph with single nodes whose embedding 
is fixed. Boyer and Myrvold algorithm, which is the starting point of this paper, 
represents embedded biconnected portions of the graph with a data structure 
that allows the embeddings to be “flipped” in constant time. More recently, in 
an unpublished manuscript, Boyer and Myrvold [3] presented an algorithm that, 
although inspired by [2], constitutes, in our opinion, a new and original approach 
to the planarity testing problem. 

The contributions of the present paper can be summarized as follows. In Sec- 
tion 3 we describe the algorithm of [2] in a new way, that is, in our opinion, easily 
readable and more suitable for an implementation. In particular, in Section 4 
we give a detailed description of a fundamental phase of the algorithm, called 
walk-up, which is essential for a correct and efficient implementation and was 
only succinctly illustrated in [2]. Finally, in Section 5 we present an experimental 
analysis which provides a general overview of the state of the art (restricted to 
efficiency issues) of the planarity testing and embedding field. 



2 Background 

We assume that the reader is familiar with graph terminology and basic proper- 
ties of planar graphs. Unless otherwise specified, we only consider graphs without 
self- loops and multiple edges, as they can be replaced with paths of length two 
without affecting planarity. 

A cutvertex of a graph G is such that its removal increases the number of 
connected components of G. A connected graph is said to be biconnected if 
it has no cutvertices. The biconnected components of a connected graph (also 
called bicomps) are its maximal biconnected subgraphs. A bicomp is said to be 
elementary if it is formed by a single edge. Note that a cutvertex belongs to 
several bicomps, while each edge falls into exactly one bicomp of the graph. 

A Depth First Search ( DFS in short) is a technique for visiting all the vertices 
of a graph. Each vertex v is assigned a DFS index , denoted DFS(v ), which 
specifies the order in which it was reached by the DFS visit, starting from the 
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root r which has DFS(r) = 1. The edges used by the DFS visit to move from one 
vertex to the next one are called tree-edges and form a spanning tree of G, called 
the DFS tree. The remaining edges are back-edges. The ancestors of a vertex v 
are the vertices in the unique chain of tree-edges from v to the root r. If a vertex 
u is an ancestor of v, then v is a descendant of u. A tree-edge links a parent 
vertex to a child vertex, the former (latter) being the one with lowest (highest) 
DFS-index, while a back-edge is thought to be oriented exiting the descendant 
and entering the ancestor. The lowpoint of a vertex v, denoted by Lowpt(v ), is 
the lower DFS index of an ancestor of v reachable through a back-edge from a 
descendant of v. The set of back-edges entering a vertex v is denoted B in (v), 
while the set of back-edges exiting v is denoted B out (v). For a back-edge e, we 
call support of e and denote by 5(e) the unique chain of tree-edges having the 
same endpoints as the back-edge e. The support of e and e form a cycle. 

A planar drawing of a graph is such that no two edges intersect (except at 
common endpoints). A graph is planar if it admits a planar drawing. A planar 
drawing partitions the plane into topologically connected regions, called faces. 
The unbounded face is called the external face. The boundary of a face is its 
delimiting circuit. The incidence list of a vertex v is the set of edges incident 
upon v. A planar drawing determines a circular ordering on the incidence list of 
each vertex v according to the clockwise sequence of the incident edges around 
v. Two planar drawings of the same connected graph G are equivalent if they de- 
termine the same circular orderings of the incidence lists. Two equivalent planar 
drawings have the same face boundaries. A planar embedding or simply embed- 
ding of G is an equivalence class of planar drawings and is described by circularly 
sorted incidence lists for each vertex v. In the following, unless otherwise spec- 
ified, we will consider the embeddings comprehensive of the specification of the 
external face. In [2] a suitable data structure is introduced which can be used 
to describe an embedded bicomp. This data structure allows to “flip” the bi- 
comp (that is, to represent the embedding in which all the adjacency lists of the 
vertices have been reversed) in constant time. 

Since a graph is planar iff its bicomps are planar, in the following section 
we assume that the graph to be processed is biconnected. A simple method for 
handling graphs that are not biconnected is based on the property, which is easy 
to prove, that a graph G is planar iff it is planar the graph G' obtained from 
G by merging two adjacent bicomps b\ and & 2 , sharing a cut vertex v, with the 
addition of an edge between two vertices adjacent to v, one belonging to b\ and 
the other to 62 - Based on this property, all the bicomps of the graph G can be 
merged with dummy edges without affecting planarity and the dummy edges 
can be removed after the embedding is computed. 

3 The Boyer and Myrvold Planarity Testing and 
Embedding Algorithm 

In this section we give a new description of the Boyer and Myrvold planarity 
testing algorithm [2] . This algorithm tests the planarity of a biconnected graph 
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in linear time by trying to build a planar embedding of it and consists of three 
steps: Preprocessing , Embedding Construction , and Embedding Reporting. 

In Step Preprocessing a DFS is performed on the input graph G. During the 
visit a DFS tree is constructed. For each vertex v we compute DFS{v ), Lowpt(v), 
the set Bi n (v), and the list L{y) of the children of v in the DFS tree sorted by 
their lowpoint values. Also, we compute H(v), that is the lowest DFS(w), with 
w in B out (v). Roughly, H{v ) is the DFS index of the vertex “nearest” to the 
root, reachable from v with a back-edge. If B out (v ) is empty, then H(v) is set to 
an “infinite” value. Further, each edge of the DFS tree is associated with a data 
structure representing an (embedded) elementary bicomp. 

Step Embedding Construction is more complex and is based on visiting one- 
by-one the vertices of the input graph in inverse DFS-index order. Observe that 
this corresponds to a post-order traversal performed on the DFS tree. 

Let v be the current visited vertex. Two strictly related tasks, called walk-up 
and walk-down (both described below), are performed starting from v. Let G(v ) 
be the subgraph of G composed by the edges of the DFS tree augmented with 
the edges in B in (w ), for each w such that DFS(w ) > v. The target of the two 
tasks is to determine (if it is possible) a planar embedding for the bicomps of 
G(v) exploiting the planar embeddings already computed for graph G(u), where 
DFS(u) = DFS(v) + 1, which are preserved up to a “flip” operation. See Fig. 1. 




Fig. 1. (a) A graph G to be planarized; the edges of the DFS tree are drawn thick. 
DFS(u) = DFSfv) + 1. (b) Graph G(u). (c) An embedding of the bicomps of G(u). 
(d) Graph G(v). (e) An embedding of the bicomps of G(v). 



In order to make this possible, the embeddings of the bicomps of G(v ), com- 
puted when vertex v is processed, must have the outer vertices of G(v) on the 
external face. We call outer vertices of G(v) the cutvertices of G(y) and the 
vertices of G(v) incident to a back-edge that is not in G(v). Observe that: (i) 
G(u ) is a subgraph G(v). (ii) The edges belonging to G(v ) that do not belong 
to G(u) are exactly the back-edges in B in (v). (iii) Since the input graph G is 
biconnected, when DFS(v) = 1, G(v) = G has a single bicomp, and a planar 
embedding of G(u), if one exists, is a planar embedding for G. 
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In the Embedding Reporting step, the embedding is copied to a target data 
structure. The rest of the section shows the walk-up and the walk-down phases. 

The purpose of the walk-up phase is to verify if there is a way to add the 
back-edges in Bi n (v) to the embedded bicomps of G{u) in such a way that the 
embedded bicomps of G(v) have the outer vertices of G(v) on the external face. 
The walk-up phase works as follows. For each edge e= (w,v) in B in (v) a path 
p e is searched (and marked) from w to v along the bicomps that contain an 
edge in 5(e). The paths must satisfy several constraints. In order to describe 
these constraints, we need to introduce some definitions. Consider an embedded 
bicomp b of G(u) containing some edges of 5(e). Observe that the edges of 5(e) 
contained in b form a subpath of 5(e). This subpath has its endpoints on the 
external face of b. We call these two vertices root of b and entry point of b with 
respect to 5(e), the former (latter) being the one nearest to (farthest from) the 
root of the DFS tree. 

Given a specific 5(e), two borders A and B can be identified from Vb, e to r t, 
along the boundary of the external face of b. If b is an elementary bicomp, the 
two “sides” of the edge (vb, e ,H) are considered as distinct borders. A vertex w 
in a bicomp b different from the root r b is defined to be externally active for b if 
there is a path from w to an ancestor of the currently being processed vertex v, 
that is only composed by a (possibly empty) sequence of tree edges not belonging 
to b plus a single back-edge. 

Two constraints, called a and /3, must be enforced on each path p e . (a) For 
each bicomp b that contains edges in 5(e) the path p e must contain either the 
border A or B from the entry point Vb >e to the root 77 of b. (/?) A path p e must 
not contain a border that has an inner vertex (i.e. a vertex other than Vb, e and 
77 ) that is externally active. Three more constraints, 7 , S, and e, are expressed 
with respect to each pair of paths p ei and p e2 traversing the same bicomp b. 
We say that the border of b used by p ei intersects the one used by p e2 if they 
share an edge (or if they share the same “side” of the sole edge of the elementary 
bicomp b ) . Non-intersecting paths can only share entry or exit vertices (or both) 
in the bicomps they both traverse. (7) If the border of b used by p ei does not 
intersect the border of b used by p e2 , the two paths must use non-intersecting 
borders in all the bicomps they both traverse. (<5) Paths p ei and p e2 must not 
use intersecting borders of b if they traverse two other distinct bicomps that are 
externally active. We call a bicomp externally active if it contains a (non-root) 
externally active vertex, (e) If the border of b used by p ei does not intersect the 
border of b used by p e2 and the root 77 of b is different from v, then 77 must not 
be an outer vertex of G(v). 

If no set of paths are found satisfying the constraints, then G is not planar [2] , 
otherwise, they are used by the walk-down phase to build the embeddings of 
the bicomps of G(y). Actually, the information used by the walk-down phase 
is simpler than the paths themselves. It only consists of the set of the borders 
of the bicomps used by the paths (that is, the marked borders) and, for each 
vertex w that was a walk-up entry point for a bicomp, of two sets of vertices 
called active roots and non-active roots of w. The set of the active roots of w 
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Fig. 2. Four different configurations for illustrating Constraints /3, 7 , 5, and e. Edges 
in Bin(v) are drawn with dotted lines, while externally active vertices are drawn black, 
(a) An example of a path that does not satisfy constraint j3. (b), (c) Examples of pairs 
of paths satisfying constraints 7 and 5, respectively, (d) An example of a pair of paths 
that do not satisfy constraint e. 



contains the roots (at most two) of the externally active bicomps traversed by 
the paths immediately before entering w. The set of the non-active roots of w 
contains the roots of the non externally active bicomps traversed by the paths 
immediately before entering w. 

Consider two back-edges e\ and e2 in Bi n (v). If the intersection of S'(ei) and 
S(e 2) is not void, we say that ei and e2 are interleaving. It is easy to prove that 
the interleaving relation is an equivalence relation. Roughly, the edges of the same 
equivalence class have the same child of v as an internal vertex of their supports. 
For each equivalence class induced by the interleaving relation, the walk-down 
process starts on a marked border A of the elementary bicomp containing v by 
pushing v onto a stack. When the stack becomes void an analogous process is 
performed on the opposite border B , except if v is the DFS tree root. Let w be 
the current vertex on top of the stack. The back-edge e from w to v (if any) 
is added to the embedding of 63 in such a way to close the border A in the 
internal part of the cycle formed by e and p e . Then, the stack is updated by: 
(i) removing w\ (ii) pushing onto the stack the next marked vertex x along the 
border adjacent to w provided that the set of active roots of w is empty; (iii) 
pushing onto the stack one of the active roots of w (if any); (iv) pushing onto 
the stack all the non-active roots of w. When the current vertex on top of the 
stack is found to be the root vertex of some bicomp &, then b is merged into 
the bicomp b b , flipping its embedding if necessary so that a marked border of b 
attaches to the just processed border of &g. 

4 Selecting Paths: Implementation Issues for the Walk-up 

In order to describe the operations performed during the walk-up phase we need 
to extend to root vertices the definition given in Section 3 for external activity 
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of non-root vertices. We define a root vertex of a bicomp b externally active 
if it exists a back-edge from to the currently being processed vertex v or if it 
exists a bicomp b' ^ b which is externally active and such that tv = r f,. Also, 
we call a vertex w on a bicomp b (w ^ rb) pertinent for b if it is an entry point 
for b and if it is not externally active for b. 

Due to space reasons, some technicalities needed to detect in constant time 
the external activity of vertices and bicomps and the pertinence of vertices are 
left out from this camera ready and can be found in [4] . 

During the walk-up phase, back-edges in B in (v) are considered one-by-one in 
arbitrary order. The operations performed for each back-edge e in Bi n (v), called 
walk-up(e) , have the purpose of marking the borders used by p e and of updating, 
if it is needed, the lists of active roots and non-active roots of the entry point 
Vb t e of each bicomp b traversed by p e . Exceptionally, walk-up(e) may reverse the 
decision taken by a previous walk-up(e / ) about which border A or B of a bicomp 
b is used by p e i. Walk-up(e), where e = (w,v), starts at vertex w and selects a 
path to v that passes through each bicomp b traversed by 5(e). It terminates 
if (i) vertex v is reached, (ii) a vertex u , marked by a previous walk-up(e / ) is 
reached and the subpath of p e from u to v is chosen to be equal to the analogous 
subpath of p e > , (iii) a non-planarity condition has been detected. On each bicomp 
b traversed by 5(e), walk-up(e) proceeds from the entry point Vb, e along both 
borders, A and B. in parallel, searching for the root rb of the bicomp or for a 
marked vertex. Borders blocked by externally active vertices can not be used. 
When the root rb is reached, the fully explored border is selected and marked, 
while the partial exploration of the opposite border is abandoned, and walk- 
up(e) “ascends” to the next bicomp traversed by 5(e), that is, starts a parallel 
exploration of its borders. When walk-up (e) begins on the first bicomp traversed 
by 5(e), it is said to be internally active. It changes to externally active when it 
starts a parallel exploration of the borders of a bicomp after ascending from an 
externally active one. In order to be able to mark the borders of a bicomp b , we 
associate with each vertex u on the external face of b two pathlnfo integers, one 
for each edge incident to u on the external face of b. During Step Preprocessing, 
pathlnfo integers are assigned a value greater than n, where n is the number of 
vertices of the input graph. The flags are considered to be clear if their absolute 
value exceeds DFS(v), where v is the currently being processed vertex, while 
are considered to be set if their absolute value is equal to DFS(v). Thus, when 
the algorithm moves to processing the next vertex, which has lower DFS index, 
the pathlnfo flags are implicitly cleared. While walk-up(e) is internally active, it 
sets the pathlnfo integers to —V, where V = DFS(v). Once walk-up(e) becomes 
externally active, it sets pathlnfo integers to +V. 

The following is a case-by-case statement of the decision points: first we give 
six rules for an internally active walk-up, then we provide the analogous six rules 
for an externally active walk-up. 

Rule 1. If an internally active walk-up(e) enters Vb, e an d either pathlnfo 
integer is set, then walk-up(e) terminates (the remaining part of p e being already 
marked properly by a prior walk-up) . 
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Rule 2. If an internally active walk-up(e) enters Vb, e and both pathlnfo 
integers are not set, then walk-up(e) traverses both borders A and B in search 
of the root r f,. (a) If a vertex w with a pathlnfo set to —V is encountered, then 
set the pathlnfo to —V along the border traversed from Vb, e to w and terminate 
walk-up(e). (b) If a border is blocked (i.e. if it has an internal vertex that is 
externally active), then the opposing border must be taken to reach the root rf,. 
(c) If the second border is also blocked, then the graph is non-planar. 

Rule 3. If an internally active walk-up(e) reaches vertex v then it terminates, 
and the pathlnfo of the border used to reach v are set to —V. 

Rule 4. If an internally active walk-up (e) traverses border A to reach the 
bicomp root r&, then at least one pathlnfo integer of rb must be clear. If both 
are clear, then walk-up (e) simply sets the pathlnfo to — V along A and then it 
ascends to the next bicomp traversed by 5(e). 

Rule 5. If an internally active walk-up (e) traverses border A to reach the 
bicomp root n,, and the pathlnfo of for the opposing border B is set to — V, 
then: (a) If B has no internal vertex that is externally active, then walk-up(e) 
must select B and set to —V the pathlnfo integers along B (until an edge marked 
— V is reached). Then, walk-up(e) can terminate, (b) If B contains only one non- 
root externally active vertex w ^ vi , )e , and if B contains no pertinent vertices 
between w and rb (endpoints excluded), then the prior —V marking of the path 
in B from w to rb can be reversed to be a — V path from (w , . . . , v e ^b, ■ ■ ■ , rb)- 
The pathlnfo of the path in B from w to rb can be cleared and walk-up(e) 
terminated, (c) If B has an internal vertex w that is externally active, but either 
u e ,b is externally active or B contains an internal vertex other than w that is 
pertinent or externally active, then the prior —V pathlnfo markings in B cannot 
be reversed. The current walk-up must select the border A and set its pathlnfo to 
—V. Since rb has been approached from both sides, the graph is non-planar if rb 
is externally active. Otherwise, walk-up(e) ascends to the next bicomp traversed 
by 5(e) (where it becomes an externally active walk-up). 

Rule 6. If an internally active walk-up (e) traverses border A to reach the 
bicomp root r b , and the pathlnfo of for the opposing border B is set to +V, 
then we must test whether the prior externally active walk-up’s path selection 
must be reversed, (a) Let w denote the vertex with a +V pathlnfo setting that 
is most distant from rb in B. If B contains any internal vertex other than w that 
is pertinent and has a +V pathlnfo setting, then there is no need to reverse the 
prior +V path selection since the back-edge ( w,v ) that will be added during 
walk-down will close a pertinent vertex in an interior face of the embedding 
regardless of which border is selected, (b) If the path (vb, e> ■ ■ ■ ,w) in B contains 
an externally active vertex x (other than w), then u blocks the prior +V path 
selection from being reversed, (c) If the prior +V path cannot be reversed, then 
the current walk-up simply sets the pathlnfo to — V on the border A. If rb is 
externally active, then walk-up(e) terminates because the graph is non-planar. 
Otherwise, walk-up(e) ascends to the next bicomp traversed by 5(e) (where it 
becomes an externally active walk-up), (d) If the prior +V path selection can 
be reversed, then the +V pathlnfo integers in B are cleared, while the pathlnfo 
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integers along the path (w , . . . , Vb^t ■ ■ ■ ,n) are set to +V. Then, walk-up(e) can 
be terminated (the remaining part of p e being already marked properly by a 
prior walk-up) 

An externally active walk-up(e) must follow the six rules given below. 

Rule 7. If the entry point Vb, e of an externally active walk-up(e) has both 
pathlnfo integers set to +V, then this is the third externally active walk-up to 
enter vi he and the graph is non-planar. Note that a single prior walk-up cannot 
have passed through Vb. e setting both of its pathlnfo integers to +V because Vb, e 
is externally active for bicomp b. 

Rule 8. If the entry point Vb, e of an externally active walk-up(e) has one 
pathlnfo integer set to +V, then walk-up(e) must select the opposing border A 
to the root (a) If A is blocked by an externally active vertex, then the graph 
is non-planar. (b) If A is not blocked, then walk-up(e) sets its pathlnfo integer to 
+V up to rb . If rb is externally active, then the graph is non-planar. Otherwise, 
walk-up(e) ascends to the next bicomp traversed by 5(e). 

Rule 9. The case in which an externally active walk-up(e) enters a vertex 
Vb t e that has both its pathlnfo integers set to —V can not occur. In fact, prior 
internally active walk-ups entering Vb, e would select the same path, so only one 
pathlnfo would be set to —V. Also, an internally active walk-up cannot pass 
through Vb, e because it is externally active. 

Rule 10. If an externally active walk-up(e) reaches vertex v then it termi- 
nates, and the pathlnfo of the border used to reach v are set to +V. 

Rule 11. If an externally active walk-up (e) enters a vertex Vb, e that has one 
pathlnfo integer clear and the other set to —V, then the marked border leading 
to the root rb must be selected and the —V settings along it changed to +V. 
Then, walk-up(e) ascends to the next bicomp traversed by 5(e). 

Rule 12. If an externally active walk-up(e) enters a vertex Vb, e whose 
pathlnfo integers are both clear, then walk-up(e) traverses A and B in par- 
allel to find rb by the shortest border that is not blocked by an externally active 
vertex, (a) If both borders to are blocked by externally active vertices, then 
the graph is non-planar. Otherwise, let A denote the shorter unblocked border 
to rb . (b) If the pathlnfo in rb is set to — V for the opposing border path B 
and if B is not blocked, then walk-up(e) must select B , set its pathlnfo to +V, 
and ascend to the next bicomp traversed by 5(e). (c) If the pathlnfo in rb is set 
to —V for the opposing border path B but B is blocked, then walk-up(e) must 
select A and set its pathlnfo to +V. If is externally active, then the graph 
is non-planar. Otherwise, walk-up(e) ascends to the next bicomp traversed by 
5(e). (d) If the pathlnfo in rb is set to +V for the opposing border path B , then 
walk-up(e) must select A and set its pathlnfo to +V. If Tf, is externally active, 
then the graph is non-planar. Otherwise, walk-up(e) ascends to the next bicomp 
traversed by 5(e). (e) If the pathlnfo in rb is clear for the opposing border B, 
then walk-up(e) must select A , set its pathlnfo to +V, and ascend to the next 
bicomp traversed by 5(e). 

Therefore, the following Theorem can be stated. 
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Theorem 1 Let G be a graph with n vertices, and let Vi be the vertex with 
DFS(vi) = i, for i = 1, . . . , n. Graph G is planar iff the walk-up phase for vertex 
Vi, with i = n, , 1, selects a path from w e to Vi for each edge e = ( w e , vf) in 
B in (c’i ) . 

5 Experimental Analysis 

In this section we compare our implementation (in the following called BM- 
GDT) with a selection of other planarity testing and embedding algorithm im- 
plementations, providing an assessment of the state of the art in this field. 

All the tests have been performed on a 2.4GHz Pentium IV personal computer 
with 1GB of RAM. 

We generated the test suites of graphs with Leda [14] and Pigale [7] gener- 
ators, which in addition to be well known and widely used, are both fast and 
rigorously built. For each test suite we generated graphs ranging from 10,000 
to 100,000 vertices, increasing each time by 5,000 vertices, 10 graphs for each 
type, with the exception of the test suite labeled Planar-Conn-P, for which we 
generated graphs ranging from 20,000 to 200,000 edges, increasing each time by 
10,000 edges, 10 graphs for each type. 

Planar-L: planar graphs generated by LEDA. The graph are not necessarily 
connected. The number of edges is twice the number of vertices. Max-Planar-L: 
maximal planar graphs generated by LEDA, hence connected and with 3n — 6 
edges, where n is the number of vertices. Non-Planar-L: the same graphs as 
Max-Planar-L in which one edge was added between two non-adjacent vertices. 
Random-L: random graphs generated by LEDA. The number of edges is chosen to 
be two times the number of vertices. The graph are not necessarily connected and 
may be planar. Planar-Conn-P: planar connected graphs generated by Pigale. 
After the generation, the graphs were processed in order to eliminate multiple 
edges and self-loops. Random-P: random graphs generated by Pigale. The number 
of edges is chosen to be two times the number of vertices. The graph are not 
necessarily connected but may be planar. 

We tested the performance of the implementation of BM-GDT against: 
LEC-L Lempel, Even, and Cederbaum algorithm [12], LEDA implementation; 
LEC-GTL Lempel, Even, and Cederbaum algorithm [12], GTL implementa- 
tion [9]; HT-L Hopcroft and Tarjan algorithm [10], LEDA implementation. 
FR-P de Fraysseix and Rosenstiehl algorithm (unpublished, see [6] for basic 
principles), Pigale implementation; BM2 Boyer-Myrvold (new algorithm, un- 
published see [3]), implementation by the first author. All the implementations 
are in C+- 1- but for BM2, which is implemented in C. The implementation of 
BM-GDT uses the data structures for representing graphs of the GDToolkit 
library [8], based, in turn, on Leda. Each algorithm implementation extracts the 
graph from a file and constructs its own data structure. The timer starts after 
the file has been completely read and stops when the test fails (non-planar) or 
when the data structure with the embedding has been constructed (planar). 
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All the implementations gave the same results as for the planarity or non 
planarity of the graphs. The experiments (see Fig. 3) reveal that most of the 
algorithms have amazingly short computation times. Even the slowest imple- 
mentations tested for planarity and, in case, constructed an embedding of the 
graphs in less than 100 /i sec. per vertex 



— • — BM-GDT 
— — - BM2 

- - FR-P 

HT-L 

— — LLC-L 

LbC-CT'L 



; 4- “■? ' 



mono 20000 30000 10000 50000 tioooo 70000 roooo <kxicio 1 00000 

(a) Planar -L 




(b) Hax-Planar-L 





(c) Not-Planar-L 



(d) Random-L 



iim gut 

II M2 
FR-P 
I IT I. 
LBC.-L 
LBC-GTL 



20000 40000 6UUUU 80000 100000 l 20000 140000 16UO0O18OOOO2OOO0O 

(c) Planar-Conn-P 



IIM GUT 

BM2 

FR-P 

llll. 

T.RC-T. 

LRG-GTT. 



0 - 1 * 1 — 1 1 1 « — 

1UU0O 20000 3000U 40000 50000 60000 70000 80000 <KX)IX> 100000 

(f) Random-P 



Fig. 3. The running times (seconds) of the tested algorithms against the test suites 
generated by Leda and Pigale. 



FR-P, BM-GDT and BM2 had similar performance profiles. Much slower 
are LEC-L, LEC-GTL, and HT-L. Also, we were unable to produce an output 
for algorithm LEC-GTL when very big non-planar graphs were involved, be- 
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cause of premature terminations of the program. In addition to connected planar 
graphs (Fig 3.e), we also tested the algorithms against planar biconnected and 
triconnected graphs generated by Pigale with analogous parameters and ranges, 
obtaining similar results. 

We have to remark that some of the implementations rely on data structures 
for graphs representations that are “general purpose” and hence are not specif- 
ically tailored for quick planarity testing, especially when very large data sets 
are involved. 
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Abstract. The fc- planar crossing number of a graph is the minimum 
number of crossings of its edges over all possible drawings of the graph 
in k planes. We propose algorithms and methods for fc-planar drawings 
of general graphs together with lower bound techniques. We give exact 
results for the fc-planar crossing number of K%k+ i,q, for fc > 2. We prove 
tight bounds for complete graphs. 



1 Introduction 

Let cr(G) denote the standard crossing number of a graph G, i.e. the minimum 
number of crossings of its edges over all possible drawings of G in the plane. For 
fc > 2, define the fc-planar crossing number as 

cr fc (G) = min{cr(Gi) + cr(G 2 ) + ... + cr(G fe )}, 

where the minimum is taken over all edge disjoint subgraphs Gj = ( V,Ei ), 

* = 1,2, ..., fc, so that E = Ei U E 2 U ... U Ek- 

Motivated by printed circuit boards, Owens [9] introduced the biplanar cross- 
ing number of a graph G, i.e. the case fc = 2. He described a biplanar drawing of 
the complete graph K n with cr 2 ( K n ) < 7n 4 /1536 + 0(n 3 ). A survey on biplanar 
crossing numbers is in [5]. Determining cr k(G) has application to the design of 
multilayer VLSI circuits [1]. 
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Much of this paper extends ideas of the papers [5] and [12] investigating the 
biplanar crossing number to the fc-planar crossing number. Section 2 gives general 
bounds for the fc-planar crossing number and exposes an important extremal 
problem: how does crfc(G) decrease when fc increases? 

Section 3 yields unexpected exact results for the fc-planar crossing number 
of some complete bipartite graphs. Complete bipartite graphs K p>q are also the 
best studied graphs with respect to planar crossing numbers. Exact results are 
known only for p < 6 and arbitrary g, [6]. Crossing numbers of bipartite graphs 
drawn on surfaces of higher geni were determined only for p < 3, and arbitrary 
q, [10]. Thus our results belong to the same rare class of exact results on crossing 
numbers (for bipartite graphs), and are direct extensions of the results of [5] for 
cr 2 (K$,n) and cr 2 (AT6 >n ). We spell out the results in more details. Recall that 
the thickness 6(G) of G is the minimum number of planar graphs whose union 
is G. By definition, cr k(G) = 0 if and only if 9(G) < k. Beineke et al. [4] proved 
that the thickness of K p q is given by 



®(Kp,q) 



pq 

2(p + q-2) 



(1) 



except, possibly, when p < q are both odd and there exists an integer k such 
that \(p+ 5) < k < \(p — 3) and q = [2 k(p — 2 )/(p — 2k ) According to (1) 
cr k(K 2k ,q) = 0, for k > 2 and any q, so the first interesting bipartite graph is 
K 2 k+i,q- We prove that for k > 2,q > 1 



cr& (I\-2k+l,q) 



q 

2k(2k - 1) 



^ q — k(2k — 1) 



g 

2k(2k - 1) 



k(2k 




and for k >2, and 1 < q < 4 k 2 



C^k(E 2 k+2^q) — 2 




g 

-2k 2 - 




Section 4 improves on the general bounds for the fc-planar crossing numbers 
of complete and complete bipartite graphs. The improvement means constant 
multiplicative factors. 



2 General Bounds 

Little is known about lower bounds for the fc-planar crossing number in general. 
Some of the lower bounds for crossing numbers, rrmtatis mutandis apply to fc- 
planar crossing numbers. For example, if G = (V,E), \V\ = n, \E\ = m, then 
the lower bound resulting from Euler’s formula, cr(G) > to — 3n + 6 for n > 3, 
generalizes to 

crfc(G) > to — fc(3n — 6). 

There is a strengthening of the lower bound resulting from Euler’s formula for 
graphs G with girth g, cr (G) > m — g(n — 2)/(g — 2) for n > g: and we get 

cr fc (G) > to (n — 2). (2) 

g - 2 
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We state a fc-planar version of Leighton’s Lemma [7] for crossing numbers 
(note that we clo not go for the best constants here, since the best constant is 
always getting improved even for the ordinary crossing number) . 

Lemma 1. For a simple graph G with n vertices and m edges, we have m < 6kn, 
or 

(3) 

Proof. Recall Leighton’s Lemma for the ordinary crossing number: m < 4n or 
cr(G) > ?n 3 /64n 2 . Consider an optimal fc-planar drawing of G , such that Gi is 
the subgraph drawn on the i th plane. Assume that the first x graphs have at 
most 4 n edges, while the last k — x graphs have more. We have 

cr fc (G) > £ cr(Gi) > £ ^ > 

i = x -\- 1 i = x -\- 1 

> fc - X ( Yh=x+ i m i \ y, 1 ( m - 4nx) 3 ^ 1 m 3 

— 64n 2 l k — x ) ~ 64 n 2 (fc — a;) 2 — 64 n 2 fc 2 ’ 



where the last inequality holds for in > 6 fcn according to the sign of the deriva- 
tive. □ 

Recall that a(G), or arboricity of G, is the minimum number of acyclic sub- 
graphs whose union covers E. By a well known theorem of Nash-Williams [8] 



a(G) 



max 

HCG 



m(H) 
<H) - 1) 



where the maximum is taken over all subgraphs H of G, with m(H) edges and 
n(H) vertices. It is easily seen that a(G) > 0(G), moreover, 0(G) > [a(G)/3], 
since m(H) < 3?r — 6 for any planar graph. 

Let P = {Vi , V- 2 , ...Vt} be a partition of V. We denote by Eij the set of edges 
with one end point in Vi and the other in Vj, hence E-a denotes the set of all 
edges with both end points in V), for 1 < i < t. Let H denote the t vertex 
graph that is obtained by contracting all vertices in V, into one single vertex 
and removing the multiple edges. We call H the mate of G with respect to P, 
or simply the mate of G. Let T\,T 2 , be a decomposition of the edge set 

of H into acyclic subgraphs of H. Let di(x) denote the degree of a: £ V(H) in 
Ti, i = 1,2, ...fc. 



Theorem 1 . Let G = (V, E), and let k be a given integer. Let {Vi, V 2 , ...V t } be 
a partition of V and let H = (V (H),E(H)) denote the mate of G. If k > a(H), 
then we can construct in polynomial time a k-planar drawing of G with at most 



k 

tp 2 + 2pq\E(H)\ + J2p 2 ( x ) 

i= 1 xeV(H) 



crossings , where p = max{|Pjj|} and q = max{ | E,j \ } , i,j = 1, 2, ...t. 
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Proof Sketch. Consider a drawing of each 7), i = 1, 2, k in plane i, with no 
crossings, so that the vertices are placed in the corners of a convex polygon, and 
each edge is drawn using one straight line segment. Now, replace each vertex 
j £ V(H) with the set Vj. In particular, place the vertices of Vj in a very small 
neighborhood around j. Next, draw the edges in E with straight line segments 
using the drawings of T,’s, to produce a fc-planar drawing of G. There will be 3 
kinds of crossings: 

(a) between edges of Ea, 

(b) between edges of Ea, and edges of E t j , i ^ j, and finally 

(c) between edges Eij, where i,j = 1, 2, ...k and i ^ j. 

The terms in the theorem correspond to these 3 cases. Note that the estimate 
for (b) is pqJ2i=i T, xeV (H) d i ( x ) = 2p<l\E(H)\. □ 

Theorem 1 can be used effectively, if the degrees appearing in the last term 
are small. In fact, in certain cases one can decompose G into a number of (cyclic) 
outer planar graphs of small maximum degree, and still use the method of Theo- 
rem 1 to obtain upper bounds for crfc(G). In this paper, we have obtained exact 
values of cr k{G) for certain graphs in this way. Nonetheless, the acyclic decom- 
positions into forests of small maximum degree has been also studied. Let a,d{G) 
denote the degree bounded arboricity, that is the minimum number of forests 
that the edges of G can be decomposed to so that the maximum degree of each 
forest is bounded by d. Truszczynski [14] conjectured that for every multigraph 
G and d> 2, 

/ A{G)/d or 1 + A(G)/d if a(G) = A(G)/d, 
d \ max (a(G), \A(G)/d\) otherwise. 

Truszczynski actually proved his conjecture for complete and complete bipartite 
graphs, and also for the case d > A(G) + 1 — a(G). Combining Theorem 1 with 
(4), we immediately obtain 

Corollary 1. For n> 1 cr k(K n ) = 0(n 4 /k 2 ). 



However, Corollary 1 also follows from the next theorem: 
Theorem 2. For any graph G on n vertices and m edges, 



cr fc(G) < 



1 

12fc2 




m 2 + O 




The corresponding drawing can be found in polynomial time. For any graph G, 



crfc(G) < 



2cr(G) 
fci°g 2 f 



2cr(G) 
£ 1 . 4708 ... ’ 



Proof. The first upper bound follows from our paper [11] (Corollary 3.2) and a 
simple observation that a drawing of a graph G in 2k pages gives a drawing of 
the graph G in k planes. The second upper bound follows by iteration from the 
inequality cr 2 (G) < |cr(G), proved in [5]. □ 

One challenging question is how cr*, changes from cr(G) to 0, as k increases 
from 1 to the thickness of G, 9(G). 
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3 Exact Results 



Theorem 3. For k > 2, q > 1 



Cr k{-^2k+l,q) 



q 

2k(2k — 1) 



^ ' q — k(2k — 1) 



2k{2k — 1) 




(5) 



Proof. Upper bound. Beineke [2] proved that the thickness of K 2k +i,2k(2k-i) 
is k by describing a drawing of K 2 k+i, 2 k( 2 k-i) in k planes without crossings. 
We extend this drawing to a drawing of K 2k+ i q in k planes with minimum 
number of crossings. Let u\,u 2 , ■■■,u 2 k+i be the vertices of the first partition. 
Let i’i,v 2 , ..., t>2fc(2fc-i) the vertices of the second partition. Beineke’s drawing 
possesses the following properties. 

1. On every plane, all Vi s lie on the vertices of the regular 2k(2k — l)-gon. 

2. All iij ’s lie inside or outside of the polygon. 

3. The edges do not cross. 

4. For every Vi, its degree on exactly one plane is 3 and 2 on the remaining 
(. k — 1) planes. Moreover, on every plane, the vertex ty has a neighbor inside 
and a neighbor outside the polygon. 



Fig. 1 shows the case k = 3, i.e. a drawing of in 3 planes without crossings. 
The graphs on the same row are drawn on the same plane. The left (right) graph 
corresponds to the inside (outside) part of the drawing on a plane. 

Now consider K 2k + i, q and assume that q = 2k(2k — l)a + b, where a,b are 
integers and 0 < b < 2k(2k — 1). Partition the q- vertices into 2k(2k — 1) almost 
equal sets Si, S 2 , ..., S 2k ( 2k _ip where 2k(2k — 1) — b sets have a vertices, and b 
sets have a + 1 elements. On every plane, replace each vertex Vj by the set Sj 
such that its vertices lie on a very short arc and the arcs do not interfere. Join 
every vertex Si to all vertices of Sj on a plane iff u i: was adjacent to Vj on that 
plane in the Beineke’s drawing. Clearly the total number of crossings is 




The above sum turns to 
fl + lA 



which gives the claimed value by substituting a = [q/(2k(2k — 1))J and b = 
q — 2k{2k — 1 )a. 

Lower bound. We will proceed by induction on q. The claim is obviously true 
for q < 2k(2k — 1). The claim is also true for 2k(2k — 1) < q < Ak(2k — 1) as the 
RHS of (5) equals q — 2k(2k — 1), which is a lower bound given by (2). Hence 
assume that the claim is true for some q > Ak(2k — 1). Using the counting argu- 
ment with H = K 2k+ i q , G = K 2k+ i tq+ i, i.e. counting the number of crossings 



+ (2k{2k - 1) - b) = a(b + k{2k - l)(a - 1)), 
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- G +1 - fc(2fc - 1) ([^T)J +1 )) 

“ q - 1 2k(2k - 1) ( 9 ~~ k ^ 2k _ ^ ( 2fc(2fc - 1) + 1 ) ) 

2k(2k - 1) ( 5 + 1 ~~ k ^ 2k ~ ^ ( 2k(2k - 1) + 1 ) ) 

To conclude the proof, it is sufficient to show that for q > 4/c(2fc — 1) the 
expression inside the big brackets of the last line is greater than —1. Let q = 
2k(2k — l)a + b, as above. Distinguish two cases. 

If b < 2k(2k — 1) — 1 then the expression inside the big brackets equals 

— — ^ a(q — k(2k — l)(a + 1)) — a(q + 1 — k(2k — l)(a + 1)) = — > —1. 

If b = 2k(2k — 1) — 1 then the expression inside the big brackets equals 

— a(q — k(2k — l)(a + 1)) — (a + 1 )(q + 1 — k(2k — l)(a + 2)) = 0. 

9-1 

□ 

Theorem 4. For k > 2 

er k {K 2k+2 , q ) < 2 [^2 J (q - k 2 [^2 J - fc 2 ) • (6) 

The equality holds for 1 < q < 4 k 2 . 

Proof. Upper bound. We start with a drawing of K 2k+2 2k 2 in k planes without 
crossings and then extend this drawing to a drawing of K 2k+2 q . Denote the 
vertices of the first partition class by ui,u 2 , ...,u k + 1 and v\,v 2 , ...,v k +\. Denote 
the vertices of the second partition class by cio,a 2 , a k 2 _ 1 and bo, 61, b k 2 _ 1 . 

On the first plane, place the vertices u\, u 2 , ..., u k +\ (resp. vi,v 2 , ...,v k +i) 
on the positive (resp. negative) part of the x axis, in this order from the ori- 
gin. Place the vertices ao, ai, a k 2 -i (resp. b 0 , bi, ..., b k 2 _ 1 ) on the positive 
(resp. negative) part of the y axis, in this order from the origin. Join iq and 
Vi to a (i _ 1 )( fc _ 1 ), ..., a ik -i and , ..., b ik -i, for all i. On the second plane, 

the positions of rq’s and v^s remain unchanged. Shift aj ’ s (resp. bj' s) cycli- 
cally up (down) by k position. Join Ui and Vi to a(i-2)(fc-i)) •••> a (i-i)(fe-l) an d 
fyi— 2)(fc— 1)> •••> b(i-i)( k -i), where the indices are computed modulo k 2 . Contin- 
uing in this drawing for all planes we get a drawing of K 2k+2 2k 2 in k planes 
without crossings. See Fig. 2 for the case k = 3. 

Now consider K 2k+2 ^ q . Partition the q vertices into 2 k 2 almost equal sets, 
Aq, Ai, A k 2 _ 1 and Bo, B\, B k 2 _ 1 . Replace every aj and bj by Aj and Bj 
and join iq and vt to Aj and Bj on a plane iff u, and Vi were adjacent to aj and 
bj on that plane. A simple counting shows that the number of crossings is 
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Fig. 2. A drawing of Kg, is in 3 planes without crossings. 



The rest is similar to the proof of Theorem 3. 

Lower bound. As 0(K 2 k+2,2k 2 ) = k, from (1), crk(K 2 k+ 2 ,q) = 0, for q < 2k 2 . 
Assume 2 k 2 < q < 4k 2 . In this interval the RHS of (6) equals 2 q — 4 k 2 , which is 
the lower bound given by (2). □ 
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4 Improved Bounds on Complete and Complete Bipartite 
Graphs 



4.1 Lower Bounds 

For specific graphs we can strengthen the lower bound by the standard counting 
argument. 

Theorem 5. For p > 6k — 1 and q > max{6fc — 1, 2k 2 } 

cr k(K P , q ) > 3( 3/ . _ ( 2 ) (2) ' 

Proof. The estimation (2) gives 

cr fc (/<6fc-i, 6 fc-i) > 12 k 2 - 4k + 1. 



Using the counting argument with H = I\Qk-i,6k-i and G = K p q we have 



cr k(K p , q ) > [p-a 1 ! [ 6 q^\ crfc(^ 6 fc-i,6fc-i) > 

\6fc— 3/ \6fc— 3/ 



3(3fc - l) 2 



Theorem 6. For n > 2k 2 + 6fc — 1 



crfc( 4 f„) > 2(3^ — 1)2 (4)' 

Proof. Let n = p + q. Combining the counting argument with F[ = K p q and 
G = K n with the lower bound from Theorem 8 we get the claim. □ 



4.2 Upper Bounds 

For special values of k we can improve on the upper bounds from Corollary 1 
and Theorem 2. 

Theorem 7. Let k — 1 be a power of a prime. For n> (k — l) 2 

w {n+k2)A - 

Proof. To appear in the full verion. 

For the complete bipartite graphs and arbitrary k we can extend the con- 
struction from the Section 3. 

Theorem 8. For p>2k + 2 and q > 2 k 2 

cr k (K Ptq ) < A^±^(p +2 fc+l) 2 ( (Z + 2fc 2 -l) 2 . 

Proof. To appear in the full version. 
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Abstract. Using results from extremal graph theory, we determine the 
asymptotic number of string graphs with n vertices, i.e., graphs that can 
be obtained as the intersection graph of a system of continuous arcs in 
the plane. The number becomes much smaller, for any fixed d, if we 
restrict our attention to systems of arcs, any two of which cross at most 
d times. As an application, we estimate the number of different drawings 
of the complete graph K n with n vertices under various side conditions. 



1 Introduction 

Given a simple graph G, is it possible to represent its vertices by simply con- 
nected regions in the plane so that two regions overlap if and only if the cor- 
responding two vertices are adjacent? In other words, is G isomorphic to the 
intersection graph of a set of simply connected regions in the plane? This de- 
ceptively simple extension of propositional logic and its generalizations are of- 
ten referred to in the literature as topological inference problems [CGP98a], 
[CGP98b],[CHK99]. They have proved to be relevant in the area of geographic 
information systems [E93], [EF91] and in graph drawing [DETT99]. In spite of 
many efforts [K91a], [K98] (and false claims [SP92], [ES93]), until very recently 
no algorithm was known for their solution. Two years ago, we showed [PT02] 
that the problem is decidable. Shortly after a more elegant proof was found by 
Schaefer and Stefankovic [SSOla], who went on proving that the question is in 
NP [SSOlb]. 

Since each element of a finite system of regions in the plane can be replaced 
by a simple continuous arc ( “string” ) lying in its interior so that the intersection 
pattern of these arcs is the same as that of the original regions, it is enough 
to restrict our attention to string graphs, i.e., to intersection graphs of planar 
curves. As far as we know, these graphs were first studied in 1959 by S. Benzer 
[B59], who investigated the topology of genetic structures. Somewhat later they 
were also considered by F. W. Sinden [S66] in Bell Labs, who was interested in 
electrical networks realizable by printed circuits. Sinden collaborated with R. L. 
Graham, who popularized the notion among combinatorists at a conference in 
Keszthely (Hungary), in 1976 [G78]. Soon after G. Ehrlich, S. Even, and R. E. 

* Janos Pach has been supported by NSF grant CR-00-98246, PSC-CUNY Research 
Award 62450-0031 and OTKA-T-032452. Geza Toth has been supported by OTKA- 
T-032452 and OTKA-T-038397. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 47-58, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 
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Tarjan [EET76] studied string graphs (see also [K83] and [EPL72] for a special 
case). The aim of this paper is to estimate the number of different string graphs 
on n vertices. 

To formulate our main result precisely, we have to agree on the terminology. 
Let G be a simple graph with vertex set V(G) and edge set E(G). A string 
representation of G is an assignment of simple continuous arcs to the elements of 
V(G) such that two arcs cross each other if and only if the corresponding vertices 
of G are adjacent. Graph G is a string graph if it has a string representation. 
We assume that any two arcs share only finitely many points and that at each 
common point the arcs properly cross, i.e., one arc passes from one side of the 
other arc to the other side. An intersection point of two arcs is called a crossing. 

For any d > 0, graph G is a string graph of rank d if it has a string represen- 
tation with the property that any two strings have at most d crossings. 

A class V of labeled graphs, which is closed under isomorphism, is said to 
be a property. A property V is called hereditary if every induced subgraph of 
every member of V belongs to V. Let V n denote the set of all (labeled) graphs 
on the vertex set {1, 2, . . . , n} that belong to V. In the combinatorics literature, 

the function \V n \ < 2® is often called the speed of property V, and there are 
several well known estimates on its growth rate as n increases. 

Let S and Sd denote the classes of all string graphs and all string graphs 
of rank d , respectively. Clearly, these are hereditary properties and we have 
5 iC5 2 C • • • C S. Our first goal is to estimate their speeds. 

Theorem 1. For the number |«S” | of all string graphs on n labeled vertices, we 
have 

2?(S) < |S n | < 2(i +o(1) )(2). 



Theorem 2. For any d > 0, the number \S^}\ of all string graphs of rank d 
satisfies |SJ| < 2°^. 

We do not have any better lower bound on |<S^| than 2 J A ralo s ri ), which follows 
from the fact that the vertex set has this many different permutations. 

A drawing of a graph is a mapping / which assigns to each vertex of G a 
distinct point in the plane and to each edge uv a continuous arc between f{u) 
and f{v), not passing through the image of any other vertex. For simplicity, the 
point assigned to a vertex is also called a vertex and an arc assigned to an edge 
is also called an edge of the drawing, and, if this leads to no confusion, it is also 
denoted by uv. We assume that (a) two edges have only finitely many points in 
common, and (b) if two edges share an interior point p , then they properly cross 
at p. Two drawings of G are said to be essentially equivalent the set of crossing 
pairs of edges is the same in the two drawings. Otherwise, they are essentially 
different. 

Let A{n) and A{n) denote the number of essentially different drawings and 
essentially different straight-line drawings, resp., of the complete graph K n with 
n vertices. For any d > 0, let Ad(n) denote the number of drawings with the 
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property that any two edges have at most d points in common. Clearly, we have 
A(n) < Ai(n) < ^(n) < Az(n) < . . . < A(n), 



for every n. 

In Sections 2 and 3, we review the extremal graph theoretic tools used in this 
paper and establish Theorem 1, respectively. In Section 4 we prove Theorem 2 
in the special case d = 1. The proof in the general case is based on the same 
ideas, but it is technically more complicated, and it is omitted in this extended 
abstract. In Section 5, we deduce the following estimates. 

Theorem 3. For the number of essentially different drawings of K n under var- 
ious restrictions, we have 

(i) 2 n ( nl °s n ) < A(n) < 2 0 ( nl °g"); 

(ii) 2°^ < Ai(n) < 2°(" 2lo s n ); 

(in) 2 fi (™ 2lo « n ) < Ad(n) < 2°^ ni \ for any fixed d > 2; 

(iv) 2°^ < A(n) < 2 0{ - ni \ 

2 Tools from Extremal Graph Theory 

One of the central questions in extremal graph theory [B78] is the following. 
Given a graph H , what is the maximum number of edges that a graph of n ver- 
tices can have if it does not contain Ft as a (not necessarily induced) subgraph? 
This quantity is usually denoted by ex(n, H). 

Obviously, the property that a graph is H- free, is hereditary. Let Forb(n, H ) 
denote the speed of this property, i.e., the number of graphs on n labeled vertices 
that do not contain H as a subgraph. It turns out that the growth rate of these 
functions crucially depends on the chromatic number y(-ff) of H. 

Theorem 2.1. (Erdos-Stone [ES46], Erdos-Simonovits [ES66]) For any graph 
FI , we have 

Theorem 2.2. (Erdos-Frankl-Rodl [EFR86]) For any graph H, we have 

Forb(n, Ft) = 2 ( 1 +°( 1 ))ex(n,^ 

If we want to establish analogous results for graphs containing no induced 
subgraph isomorphic to Ft, then the first difficulty we have to face is the follow- 
ing: unless Ft is a complete graph, the maximum number of edges that a graph 
of n vertices can have without containing an induced copy of Ft is (!() ■ Thus, 
Theorem 2.1 does not have a direct analogue. Nevertheless, set 

ex*(n,ff) := (l - y + o(n% 
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where the relevant quantity, t{H), taking the place of the chromatic number is 
defined as follows. 

We say that H is (r, s)-colorable for some 0 < s < r if there is an r-coloring 
of the vertex set V(H), in which the first s color classes are cliques (i.e. , induce 
complete subgraphs) and the remaining r — s color classes are independent sets 
(i.e., induce empty subgraphs). Let C(r, s ) denote the class of all (r, s)-colorable 
graphs, i.e., 

C(r,s ) = {H : H is (r, s)-colorable} . 

Let t(H) be the minimum integer r such that H is (r, s)-colorable for all 0 < 
s <r. Clearly, we have t(H) > x(H), for every H. 

Let Forb*(n, H) stand for the number of graphs on n labeled vertices which 
does contain H as an induced subgraph. 

Theorem 2.3. (Promel-Steger [PS92]) For any graph H, we have 
Forb *(n,H) = 2 ( 1 +°( 1 ))e** (",»). 

Using Szemeredi’s Regularity Lemma, Bollobas and Thomason [BT97] gen- 
eralized this result to any nonempty hereditary graph property V. Define the 
coloring number r{V) of V as the largest integer r for which there is an s such 
that all (r, s)-colorable graphs have property V. That is, 

r(V) = max{r : there exists 0 < s < r such that V D C(r, s)}. 

Consequently, for any 0 < s < r( V) + 1, there exists an (r( V) + 1, s)-colorable 
graph that does not have property V . 

In the special case when V is the property that the graph does not contain 
any induced subgraph isomorphic to H, we have r(V) = t(H) — 1. 

Theorem 2.4. (Bollobas-Thomason [BT97]) Let V be a nontrivial hereditary 
property of graphs, and let V n denote the set of all graphs in V on the vertex set 
{1, 2, . . . n}. Then the speed of property V satisfies 

\'P n \ = 2( 1 -^+°( 1 ))(S) ; 
where r(fP) is the coloring number ofP. 



3 String Graphs Proof of Theorem 1 

We start with the lower bound. Consider four pairwise tangent non-overlapping 
disks Di, 1 < i < 4, in the plane (see Fig. 1). Assume for simplicity that n is 
divisible by 4. The proof for other values of n is analogous. Replace the boundary 
of each Di by n / 4 slightly smaller concentric circles CA, 1 < k < n/ 4, running 
very close to it. Fix a pair ( i,j ), 1 < * < j < 4. By local deformation of every 
Cik in a small neighborhood of the point of tangency of Di and Dj , we can 
achieve that every Cik has a point lying outside every other Cih, h ^ k. For 
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every 1 < l < n/4 and for any predetermined set of indices Ki C {l,2,...,n/4}, 
we can now slightly modify Cji so that it would intersect a curve C^ if and 
only if k G /\;. In other words, we can arbitrarily specify the bipartite crossing 
pattern between the curves Cik and Cji, 1 < k, l < n/4. Repeating the same 
procedure for every pair (i. j), we can obtain any 4-partite crossing pattern 
between the 4 classes, each containing n/4 curves. Note that every Cik is a 
closed curve, but deleting any point of it which does not belong to another curve 
it becomes a string. Thus, the number of essentially different string graphs is at 

i 6n^ 3 

least 2 tb“ > 2 4 w. 




Fig. 1 . Lower bound construction for the number of string graphs. 



Next, we establish the upper bound. For any r > 2, let G r be a graph with 
vertex set 

V(G r ) = {vij : 1 <i,j <r} 

and edge set 

E(G r ) = {vijVik : l<i,j,k<r, j ^ k} , 

where = Vji , for every i and j. In other words, the vertices of G r repre- 
sent the vertices and the edges of the complete graph K r , two vertices of G r 
being connected if the corresponding two edges of K r share an endpoint or the 
corresponding edge and vertex of K r are incident. 

Lemma 3.1. We have r(G r ) = r. 

Proof. The vertices 1^,1 < j < r form a clique of size r. Therefore, we have 
r(G r ) > x(G r ) > r. 

Now we show by induction on r that r(G r ) = r. This is true for r = 2. Let 
r > 2 be fixed and assume r(G r _ i) = r — 1. We have to show that, for any 
0 < s < r, the vertices of G r can be colored by r colors so that s color classes 
induce cliques and the remaining r — s color classes are independent sets. 
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For s = 0, the following coloring will satisfy the requirements. For any 1 < 
k < r, color a vertex v^j with color k if and only if i + j = k mod r. Clearly, each 
vertex of G r receives a color and each color class is an independent set. 

If s > 0, color each vertex of the clique {vij : 1 < j < r} with color 1. The 
uncolored vertices induce a subgraph isomorphic to G r - 1 , for which we have 
r(G r - 1 ) = r — 1, by the induction hypothesis. So the remaining vertices can be 
colored by r — 1 colors so that s — 1 color classes induce cliques and the other 
r — s are independent sets. 

Lemma 3.2. G 5 is not a string graph. Proof. Suppose that G 5 has a string 

representation. Continuously contract each of string (arc) representing va (1 < 
i < 5) to a point pi : without changing the crossing pattern. For every pair i ^ j , 
consider the portion of the arc representing between the points pi and pj. 
These arcs define a drawing of it's, in which no two independent edges cross each 
other. However, A '5 is not a planar graph, hence, by a well known theorem of 
Hanani and Tutte [Ch34], [T70], no such drawing exists. 

Now we can complete the proof of Theorem 1. By Lemma 3.2, a string graph 
cannot have an induced subgraph isomorphic to G5. Thus, in view of Lemma 
3.1, Theorem 1 directly follows from Theorem 2.3: 

\S n \ < Forb* n (G 5 ) = 2(j +o(1) )(S). 



4 String Graphs of a Fixed Rank — Proof of Theorem 2 

In order to show that there are 2°(" 2 ) string graphs of rank d , in view of Theorem 
2.4, it is enough to exhibit a (2, 0)-colorable, a (2, l)-colorable, and a (2,2)- 
colorable graph such that none of them is a string graph of rank d. 

Here we present the argument only in the special case d = 1. 

Let H 3 3 denote a graph with vertices Ui , Vj, and Wij, 1 < i,j < 3 and 
edges UiWij,WijVj , for every i and j. In other words, H3 3 is the graph obtained 
from ^ 3 , 3 , the complete bipartite graph with three vertices in its classes, by 
subdividing each of its edges by an extra vertex. 

For any k , let denote a graph with vertices Uj, (1 < i < k) and uj , for 
every I C (1,2,..., k}. Let Vi and Vj be connected by an edge of Tfc, for any 
1 < * < j < k, and let Vi be connected to ui if and only if iff. Let Tj. denote 
the graph obtained from Tfc by adding the edges ujuj, for every I ^ J. 

Clearly, iL 3j3 is (2, 0)-colorable (bipartite), Tj. is (2, l)-colorable, and T^ is 
(2, 2)-colorable, for every k. Therefore, if V = V(H^^,Tk,T' k ) denotes the prop- 
erty that a graph does not contain # 3 , 3 , Tfc, or T k as an induced subgraph, then 
V is a hereditary property with coloring number r(V) = 00 . Hence, by Theorem 
2.4, for the number of graphs on n labeled vertices, satisfying property T, we 
have \V n \ = 2 °(" 2 ). 

It remains to prove the following statement, which implies that C V n if 
k is large enough. 
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Lemma 4.1. A string graph of order 1 cannot contain H 33 , Tf., or T' k as an 
induced subgraph, provided that k is sufficiently large. 

Proof. It is well known that a string graph cannot contain H 3 3 as an induced 
subgraph (see e.g. [EET76], 

Using the notation in the definition of T k (and Tjf), let Vi, 1 < i < k and uj, 
I C {1,2,..., /c} stand for the vertices of T*, (and Tf, resp.), and suppose that 
Tfc (and Tf, resp.) has a string representation in which any two strings cross at 
most once. For simplicity, we use the same notation for the strings as for the 
corresponding vertices. 

Fix arbitrarily an orientation of each string. For any triple (x, y, z), 1 < x < 
y < z < k, let f xyz = 1 if along v y the crossing with v x follows the crossing with 
v z . Otherwise, set f xyz = 0. 

By Ramsey’s theorem, there exists a “homogeneous” subset J C {1, 2, . . . , k}, 
\J\ > log log k, such that f xyz is constant over all triples ( x,y,z ), 1 < x < 
y < z < k, x,y,z £ J . We can assume without loss of generality that J = 
{1,2,..., m}, where to > log log k. 

For any 1 < i < m, the string Vi crosses all other Vj, 1 < j < m, i yf j 
exactly once. Since f x { z is constant over all triples (x,i,z), 1 <x<i<z<k, 
one can find a non-crossing point on Vi that divides Vi into two parts, vf and 
vf, containing all crossings between Vi and v x with x < i and between Vi and v z 
with z > i, respectively. The arcs vf and vf are called the lower part and the 
upper part of v, , respectively. 

Construct two 42-uniform hypergraphs, H < and H > , both on the vertex set 
{1,2,..., to}, as follows. For any 1 < X\ < X 2 < • • • < x S3 < to, there exists 
a string u = 3 } that crosses v Xl , v X2 , . . . , v XS3 , but no other Vj. The 

string u crosses either the lower or the upper part of each v Xi , so for at least 42 
indices 1 < i < 83 it will cross, say, the lower (resp., upper) part. Suppose, for 
example, that u crosses the lower (resp., upper) parts of v Xl ,v X2 , . . . ,v X42 . Then 
add the hyperedge {xi,X 2 , ■ ■ ■ , X 42 } to H < (resp., to H > ). 

Repeating the above procedure for every 83-tuple 1 < X\ < X 2 < • • • < 

< to, the total number of hyperedges in H < and H > with repetitions is (^) . 
However, the multiplicity of each hyperedge is at most ( m 4 " 1 42 ) ■ Thus, the total 
number of distinct hyperedges in H < and H > is C(to 42 ) (i.e., at least constant 
times to 42 ). Suppose without loss of generality that H < has Q (to 42 ) distinct 
hyperedges. 

We can now apply a well known result of Erdos [E65] (see also [B78] and 
[PA95], p. 151) to conclude that, for any fixed l and sufficiently large to, our hy- 
pergraph H < contains a complete 42-partite, 42-uniform subhypergraph Kf 2 t 
with l elements in each of its classes. (That is, Kf 2 ( has 42 1 vertices, divided 
into 42 classes of size l, and it consists of all 42-tuples that contain one vertex 
from each class.) 

For simplicity, denote by sj, 1 < i < 42, 1 < j < l the lower parts v x of 
the strings v Xk corresponding to the vertices of Kf 2 ; . By the construction, for 
each 42-tuple (j 1 , . . . ,^ 42 ), 1 < j \, . . . , J 42 < l, there exists a string u 7l) ;742 that 
crosses s} 1 , . . . , S 42 2 , but no other string sj . 
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Fig. 2. Some of the strings representing a A' 42 .., 3- 



Color the 42-tuples (j \ ... . . J 42 ) with 42! colors, according to order in which 
the crossings with sj 1 , . . . , s 4 “ occur along ttj 1 ,...j 42 - Thus, we can find at least 
L?(^ 42 ) 42-tuples of the same color (say, white). Suppose without loss of generality 
that, for each such 42-tuple (ji , . . . ,. 742 ), the string '«j^..., 742 first crosses sj 1 , then 
Sg 2 ,..., and finally Applying Erdos’s result again, if l is sufficiently large, we 
can find a subhypergraph K ^ 2 3 C Kf 2 ; , all of whose 42-tuples are white. 
Again, we can assume without loss of generality that the strings corresponding 
to the vertices of K | 2 3 are sj, 1 < i < 42, 1 < j < 3. Recall that each s\ is 
the lower part of an original string v x , therefore, no two sj can cross each other. 
(Indeed, the intersection of v x and v y , x < y, must belong to the upper part of 
v x and at to the lower part of v y .) 

Summarizing: we have 3 • 42 = 126 strings sj, 1 <i< 42, 1 < j < 3, no two 
of which intersect. Moreover, for each 42-tuple (ji , . . . , 742 ), 1 < ji, ■ ■ ■ , J 42 < 3, 
there is a string Wj 1 ,... j42 that intersects the strings sj 1 , . . . , S 42 2 in this order, 
and does not intersect any other sj. (See Fig. 2.) We would like to show that 
there are two different strings of the type u :/1 .... ;j42 that cross more than once. 
First, we give a lower bound for the number of crossings CR(u, u ) between strings 
of type u n ,.... Ji2 - 

Let 1 < x < 41 be fixed. For any pair y, z, 1 < y, z < 3, consider all strings 
An, ■■■042 w hh jx = V an( i jx+i = z , and let r VtZ denote the set of their portions 
between their intersections with s v x and s x+1 . Clearly, we have \r y>z \ = 3 40 . 
Pick one element from each F y z , 1 < y, z < 3, and notice that at least one 
pair among these 9 arcs must be crossing, otherwise, together with the strings 
s 4 , s 2 , s^, s 4 4 , s 2 +1 , s^+iJ they would give a string representation of H 3 3 , which 
is impossible (see the first paragraph of this proof). Thus, for a fixed x , the total 
number of crossings between the elements of F y ^ z and F y i z i over all y,z : y',z', 
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1 <y,z, y', z' < 3, (y, z) ± (y', z') is at least 

ni<„„<3 \FyA _ 3 9 ' 40 _ 80 

37-40 — 37-40 — 6 ‘ 

Here the denominator, 3 7 ' 40 , is the number of 9-tuples of arcs, one from each 
set r y>z , 1 < y, z < 3, in which a crossing pair of arcs is fixed. Repeating this 
count for every x, 1 < x < 41 and noticing that every time we count different 
crossings, we obtain that 

CR (u,u) > 41 • 3 80 . 

On the other hand, the number of strings of type u Jt ,.... )42 is 3 42 . If any two 
of them cross at most once, than CR(u, u) < 3 84 /2, which is a contradicts the 
above inequality. This completes the proof of the lemma. 



5 Drawings of Complete Graphs Proof of Theorem 3 

(i) It is easy to see that the order type on the vertices of K n (i.e., the orientation 
of its triples) determines the set of crossing pairs of edges, So the upper bound 
follows from a result of Goodman and Pollack [GP86] , that there are at most n 6n 
different order types on n points. On the other hand, we can place the vertices 
of K n on a circle, in (n — 1)! different cyclic order, and each placement gives a 
different list of crossing pairs of edges. It is also easy to come up with a list of 
nP (") drawings such that by relabelling the vertices of any one of them, we do 
not obtain a drawing essentially the same as another. 

(ii) Suppose n is divisible by 4, and let Uj = (— 1 ,i),Uj = (1 ,j), and Wk = 
(0,fc/2), for any 1 < i. j < n/4 and 1 < k < n/ 2. For every 1 < k < n/2, 
connect and Wk+i by a straight-line segment. Furthermore, connect every 
Vi to every Uj by a line segment so that each such segment passes through 
some point Wk- By slightly bending each edge ViUj , but keeping its endpoints 
fixed, we can achieve that it passes either slightly above or slightly below u>i + j. 
At each edge v^Uj , we have two choices, so there are 2” 2 / 16 possibilities. In 
each drawing, any two edges cross at most once, and different choices give rise 
to different crossing patterns. (Indeed, ViUj passes above Wi+j if and only if 
it crosses the edge Wi+jWi+j+i.) Finally, one can slightly perturb the vertices 
so that no three of them would be collinear, and connect the missing pairs by 
straight-line segments without creating more than one crossing between any pair 
of edges. Therefore, the number of different crossing patterns is at least 2 ra Z 16 . 



As for the upper bound, for a fixed drawing, for each vertex v t , list the edges 
incident to u, in clockwise order around Vi. For every vertex, we have (n — 2)! 
possibilities, so there are ((n — 2)\) n < 2 n log " different sets of lists. We claim 
that this set of lists uniquely determines the crossing pattern. To see this take two 
edges, V 1 V 2 and V 3 V 4 , and consider the drawing of K 4 induced by these vertices, 
as a drawing on the sphere. Two spherical drawings of K 4 are combinatorially 
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Fig. 3. The eight combinatorially different drawings of K 4 . 



equivalent if the corresponding maps are isomorphic. There are 8 combinatorially 
different drawings of K 4 , with the property that any two edges have at most one 
point in common (see Fig. 3), and these drawings can be distinguished by looking 
at the cyclic orders of edges incident to a vertex. Hence, the cyclic order of edges 
at the vertices determines whether V 1 V 2 and U3V4 cross each other. 

(iii) Suppose n is divisible by 3. For i= 1,2,..., n/3, let Vi = (—1, i),Wi = (0, i), 
and Ui = (1,*). Connect every to every Uj , as follows. Choose a number k , 
0 < k < n/3, and connect both and Uj to (0, k + s) by a segment. Also connect 
any two consecutive w/ s by a segment. In the resulting drawing, any two have at 
most two common points, and a different choice for any ViUj results a different 
crossing pattern. Therefore, the number of different crossing patterns is n/3 ra / 9 . 
Clearly, each of these drawings can be extended to a drawing of the complete 
graph such that still any two have at most two common points. For instance, 
slightly perturb the points together with the existing edges, so that the points 
are in general position, and add the missing edges as segments. 

For the upper bound, apply Theorem 2 for the edges of K n regarded as (£) 
strings. 

(iv) Suppose n is even, and let Vi = (—1,?), u, = (1 , z) , for 1 < i < n/2. For 
any i,j, 1 < i < j < n/2, connect Vi with (0,m + j) and connect (0 ,ni + j) 
with Vj. Now, all vertices Vt and all edges connecting them are on the left side 
of the line x = 0 such that each of the edges has exactly one point on that line, 
and all these points are different. On the other hand, all vertices Ui, are on the 
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right-hand side of the line x = 0. So, for any p, q, 1 < p < q < n/2 , and for 
any set K pq C {(*, j) : 1 < i < j < n/2}, we can draw the edge v p v q so that it 
crosses UiUj (i < j) if and only if (i,j) £ K pq (cf. proof of Theorem 1). 
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Abstract. Intersection graphs of convex polygons inscribed to a circle, so called 
polygon-circle graphs, generalize several well studied classes of graphs, e.g., in- 
terval graphs, circle graphs, circular-arc graphs and chordal graphs. We consider 
the question how complicated need to be the polygons in a polygon-circle repre- 
sentation of a graph. 

Let cmp(n) denote the minimum k such that every polygon-circle graph on n 
vertices is the intersection graph of fc-gons inscribed to the circle. We prove that 
cmp(n) = n — log 2 n + o(log 2 n) by showing that for every positive constant 
c < 1, cmp(n) < n — clogn for every sufficiently large n, and by providing 
an explicit construction of polygon-circle graphs on n vertices which are not 
representable by polygons with less than n — log n — 2 log log n comers. We also 
show that recognizing intersection graphs of fc-gons inscribed in a circle is an 
NP-complete problem for every fixed k > 3. 



1 Introduction 

Intersection graphs of geometric objects, namely in the plane, are intensively studied 
both for their practical motivations and for interesting structural and algorithmic proper- 
ties. Many hard (NP-complete in general) optimization problems become polynomially 
solvable when restricted to various classes of intersection graphs. Probably the oldest 
and simplest of these are interval graphs, intersection graphs of intervals on a line [4], 
whose structure is well understood, they are recognizable in linear time, and for which 
problems like clique, independent set, dominating set, chromatic number and many more 
are tractable. On the other end of the spectrum are string graphs, intersection graphs of 
arc-connected sets in the plane, which are hard to recognize [8], and whose recognition 
was only recently shown decidable [12,15] and then even more surprisingly in NP [16]. 

Geometric intersection graphs also provide a special kind of graph visualization. 
Vertices are represented as geometric objects and adjacencies are visualized by nonempty 
intersections. Overlapping regions are illustrative and enhance the visual understanding 
of the represented graph. String graphs are closely related to another graph drawing 
invariant, the crossing number , i.e., the minimum number of edge crossings in a drawing 
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of a graph. Abstract topological graphs (cf. [10]) are graphs with specified pairs of edges 
which are allowed to cross in a feasible drawing. In this setting it may happen that 
multiple edge crossings occur in every feasible drawing, and an abstract topological 
graph may actually require exponential number of crossing points [11]. The crossing 
number of abstract topological graphs is closely related to the number of crossing points 
in representations of string graphs as intersection graphs of curves, and in fact in this 
way the decidability and NP-membership of string graphs were tackled. 

Several well studied classes of graphs which directly generalize interval graphs in- 
volve geometric objects bound to a circle. Among these are circle graphs, intersection 
graphs of chords of a circle, and circular arc graphs, intersection graphs of intervals 
on the circle. Both these classes are recognizable in polynomial time (cf. [1,3,13] for 
circle graphs, [17,2] for circular arc graphs). A common generalization of these two 
are polygon-circle graphs, intersection graphs of convex polygons inscribed to the cir- 
cle. This class was first suggested by M. Fellows [personal communication with the 
first author] in 1988, when it was pointed out that this class of graphs is closed un- 
der taking induced minors. Under a different name of spider graphs, polygon-circle 
graphs appeared in [7], where a polynomial time recognition algorithm was announced. 
Somewhat surprisingly, the algorithm was never published, and this fact creates certain 
doubts about its correctness. A structural property of graph classes which many inter- 
section graphs possess is near-perfectness in the sense of Gyarfas [5]. A graph class is 
near-perfect if the chromatic number of each of its graphs is bounded by a function of 
the clique number of the graph. (For perfect graphs this function is identity, and from 
here the notion originates.) Polygon-circle graphs are near-perfect as shown in [9], and 
the bound presented therein improved the up to that date best bound for circle graphs 
and has not been improved since. 

In the current paper we pay closer attention to the question of how complicated should 
be the polygons representing the vertices of a polygon-circle graph with n vertices. One 
can easily see that n-gons always suffice (hence no exponential surprises as in the case 
of string graphs), which means that polygon-circle graph recognition is definitely in NP. 
It is conceivable, however, that polygons with less corners would suffice. To be able to 
precisely formulate this question, we define the complicacy of a graph G as the minimum 
k such that G is the intersection graph of convex fc-gons inscribed to a circle, and we 
denote this invariant by cmp(G) (we set cmp(G) = oo if G is not a polygon-circle 
graph). We further define cmp(n) to be the maximum of cmp(G) over all polygon- 
circle graphs with n vertices. The main result in this direction is the following (here and 
throughout the paper, all logarithms are base 2): 

Theorem 1. We have cmp(n) = n — log n + o(log n). 

The lower and upper bounds are proved separately in the next two sections. In the last 
section we consider the computational complexity of determining the complicacy of a 
graph with the following result: 

Theorem 2. For every fixed finite k > 3, it is NP-complete to decide whether cmp(G) < 
k holds for an input graph G. 

This result, which answers an open problem listed at J. Spinrad’s web page [14], is 
not in contradiction with the announced algorithm of Koebe for recognition of polygon- 
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circle graphs (i.e., deciding cmp(G) < oo), but it definitely sheds new light on the 
problem of recognizing polygon-circle graphs. Note also that for k = 2, cmp(G) < 2 
if and only if G is a circle graph, a polynomially decidable question. 

2 Technical Notions and Observations 

Throughout the paper we use small letters as a, b, u, v, . . . for vertices of the graph 
under consideration, and if R is a representation by polygons, R v denotes the polygon 
representing vertex v. However, in figures, to avoid multiple subscripts, we will usually 
omit the symbol R. We assume that the bounding circle is fixed and whenever we speak 
about polygons, we automatically assume that the polygons are convex and have all 
corners placed on the circle. 

If P is a polygon, then the connected parts obtained from the bounding circle by 
deleting the corners of P are referred to as the P-segments. If two polygons represent 
nonadjacent vertices, they must be disjoint and hence all corners of one of them lie 
within the same segment determined by the other one, and vice versa. In the following 
technical definition we assume that all polygons under consideration are disjoint. 

Definition 1. We say that polygon P blocks polygon Q from polygon S if the corners of 
Q lie in a different P -segment than the corners of S. If a set S of polygons is such that 
none of them blocks any other two polygons from each other, we say that the polygons 
are positioned around the circle. 

See Figure 1 for an illustrative example of blocking polygons, and a set of polygons 
positioned around the circle. Next we make the first simple but useful observation. 




Fig. 1. In the left, polygon P blocks polygon Q from polygons S and T, in the right, all four 
polygons are positioned around the circle. 



Proposition 1. In any representation R of the cycle C^k with 2 k vertices u-\ , it 2, ■ ■ ■ ■ «2fc, 
the polygons R U2i = are positioned around the circle. If \V' k is the graph 

obtained from C2k by adding a vertex v adjacent to U2i, i = 1, 2, . . . , k ( i.e., W' k is 
the wheel Wk with each rim edge subdivided), then R v has at least k corners and 
cmp(VFj') > k (in fact, the complicacy ofW k equals k). 
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Fig. 2. An example of the lower bound construction for 1=1 and n = 11. We write simply P 13 
f° r P{i, 3 } etc. 

Proof. For any three (even indexed) vertices U 2 i,U 2 j,U 2 h of C'-iA, there exists a path 
connecting U 2 j and 11 , 2 k which does not contain any neighbor of 112 , ■ The union of the 
polygons representing vertices of this path is a connected subset of the disk bounded by 
the base circle, and hence (by the Jorda curve theorem) R U2j must not be blocked from 

h by R-U2i • 

For the wheel graph W' k , it follows that R v must have at least k corners, to intersect 

all R U2i G = 1,2 ,k. 

3 Complicacy of Representations - The Lower Bound 

Theorem 3. For n large enough, we have cmp(n) > n — log n — 2 log log n. 

Proof. The proof is by constructing graphs of large complicacy by an explicit construc- 
tion. Suppose n is large enough (how large will follow from the calculations in the proof). 
Let l be the uniquely defined integer such that 



We construct the graph G = ( V , E) with vertex set 

V = {v}ULL)P 1 

where vertex v is adjacent to all other vertices, L is a clique of size 2/: + 2, and P is 
an independent set whose vertices are indexed by distinct (£ + l)-element subsets of L. 
These indices will determine the adjacencies between vertices of P and L as follows. If 
p a 6 P (with a C L, |a| = l + 1), we make p a adjacent to all x € a and to no other 
vertices of L. (Note that G is not determined uniquely, it depends on the choice of the 
sets used for indexing the vertices from P. Any choice of distinct indices works.) 
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We claim that G is a polygon-circle graph and that cmp(G) > |P|. For the first part 
of the claim, choose a point O on the circle and position the polygons corresponding 
to the vertices of P around the circle (so that they do not block O from one another). 
Choose one corner of each of them as its reference point and represent each vertex x of L 
by the convex hull of the reference points of the vertices of P adjacent to x and of O. By 
making O a corner of each R x we guarantee that every two polygons R u , R w ,u , w £ L 
intersect ( L is a clique). Finally R v will be the convex hull of O and the reference points 
of all polygons representing vertices of P. (Note that the auxiliary point O may not be 
necessary, e.g., in the case when for every two vertices in L, P contains a vertex adjacent 
to both of them.) 

To argue that cmp(G) is large, consider an optimal representation R of G. Note first 
that for n > 5 we have £ >1 and hence ( 2 /) > 2- This means that n > 2£ + 3 and 
indeed G contains all vertices of L. 

The key observation is that the polygons R pi p £ P must be positioned around 
the circle. For suppose this is not the case, say R Pb blocks R Pa from R Pc for some 
a,b,c C L. Since these subsets are different but of equal size, there must be an a; £ a \ b 
and ay £ c\b. By the definition of G, R x intersects R a and R y intersects R c , but 
none of R x , R y intersects IR. But that means that It, blocks R x from R y and these two 
polygons cannot intersect each other (though x and y belong to the clique IS). 

To intersect all polygons representing vertices of P, R v must have at least | P\ corners, 
and hence cmp(G) > |P| = n — 2t — 3. The rest is a simple calculation. 

Assume for contradiction that £ > + log log n — § . Then 

/2A 2 2e n log 2 n 

n> \£J > 2£+l > 8 (log n + 2 log log n — 2) > ? * 

for large enough n (since lim^^oo gpog^fog^ogn^) = °o), a contradiction. Therefore 
(for every large enough n), £ < lo " + log log n — | and 

cmp(G) > \P\=n — 2£ — 3 > n — log n — 2 log log n 



as claimed. 

4 Complicacy of Representations - The Upper Bound 

Theorem 4. For every positive constant c < 1, there exists an no such that cmp(n) < 
n — clog n for every n > no- 

Proof. Let G = (V, E ) be a graph on n vertices and let cmp(G) = k > 4. Consider a 
polygon-circle representation R of G such that no two polygons share a corner and such 
that every polygon has at most k corners (this can be always achieved by splitting the 
comers). Let our representation have the minimum total number of corners among all 
such representations. Choose a vertex v £ V such that R v has k corners, and denote its 
comers v 1 , v 2 , . . . , v k as they appear clockwise around the circle. 

Based on this representation R, define A to be the set of vertices x such that R x has 
all corners within two consecutive /(^-segments, i.e., such that It, intersects R r only in 



64 



J. Kratochvfl and M. Pergel 




Fig. 3. Illustration to the choice of representatives of index classes. 



the triangle v l ~ 1 v l v' l+1 , for some i. For x € A, denote thiszby j(a:) and call it the index 
of x. We can see that for every i, there exists an x € A of index i. For if such a vertex 
did not exist for some i, we could reduce the number of comers in the representation by 
deleting the triangle v‘ l ~ 1 v l v' l+1 from R v , contradicting the choice of R. 

Now assume that R' minimizes "}Z x( z A j (x) among all representations with the same 
central polygon R v , with the same total number of corners and with the same set A (when 
defined as above) as R. For the sake of simplicity we call R' again just It. 

For every i, choose one vertex as a representative of the vertices of index i, and 
call it a,i. As argued above, a, is well defined for every i. The intersection graph of 
the polygons R ai ,i = 1,2, . . . , k is either a cycle or a disjoint union of paths, since 
a, can only intersect Oj_i and/or Oj+i- We denote A 1 the set of those that have 
no neighbors among the other ah’s, and A 2 = {ai, ct 2 , . . . , dfc} \ A\. We then denote 
B = (V\ {n}) \ (A\ U A 2 ) and B\ the set those vertices of B which are adjacent to at 
least one vertex of A\. 

Next we claim that every two vertices of A\ have different sets of neighbors. For 
suppose that {x\xai € E} = {x\xaj € E} for some j < i. Then the polygons 
representing the neighbors of dj intersect one side of R a . , and hence both R a . and R, l;i 
could be replaced by two parallel chords (digons) placed close enough to this side. This 
would result in a new representation of the same graph, with the same position of R v 
and the same set A, but with a strictly smaller sum of the indices of the vertices of A, 
contradicting the choice of R. See an illustrative example in Figure 4. 

The immediate but very important consequence is that 



I -Ail < 
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Fig. 4. Illustration to reducing the sum of indices of the A vertices. 



since v is a common neighbor of all vertices in A j , and on the other hand no vertex of 
Ai is adjacent to any other vertex of A\ IJ /i-j. 




Fig. 5. Illustration to the construction of B 2 , the blocking set for A 2 - 
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Finally we attend to vertices of A 2 . Consider the example in Figure 5. If R ai and 
R a2 intersect, they must intersect in the area bounded by the v 1 v 2 side of R v and the 
corresponding /^-segment. Choose a point v 12 in this segment between the leftmost 
corner of a 2 and the rightmost corner of ai. If we now replaced the central polygon R v 
by the convex hull of i> 12 , i> 3 , ...,v k (i.e., the corners v 1 andc 2 are replaced by i> 12 ), the 
new R v (denoted by dashed lines in the figure) would still intersect both R ai and R a2 , 
but the total number of corners would decrease. Since this is impossible by the choice 
of R, the new collection of polygons does not represent G anymore. There are only two 
possible reasons. Either the new R v intersects a polygon IR which is not supposed to 
be intersected (as in Figure 5 bottom left), or the new R v loses intersection with some 
Rb that it previously crossed (as in Figure 5 bottom right). In the former case, Ri, would 
lie fully within the segment v 1 v 2 , in the latter one, b £ A and its index is j(b) = 1 (or, 
symmetrically, j(b) = 2). In any case, we choose one such b and call it a witness for 
(1, 2). Similarly, we choose a witness for every (*, i+ 1) such that R a . nR ai+1 ^ 0, and 
denote by B 2 the set of witnesses constructed in this way. By the construction, /i 2 C B. 
It may happen, though, that one vertex of B 2 is a witness for two pairs (a witness of 
the latter type and index i could have been chosen both for the pair (i — 1, i) and for 
( i , i + 1)). Given the fact that a path of t consecutive vertices from A 2 would involve 
t — 1 pairs, such a path would give rise to at least distinct witnesses, and we obtain 
the following inequality: 

|A 2 | <3|B 2 |. 

(Note here, that B 1 and B 2 are not necessarily disjoint.) 

Now we summarize 

k = cmp(G) = | Ai| + |A 2 | < 2^ + 3|B 2 | < 2 n ~ k + 3 (n - k ) 

(the last inequality follows from the fact that A\ U A 2 U i?i C V and Ai U A 2 U B 2 C V, 
while in each case the three summands are pairwise disjoint), and hence 

2 n > 2 k (4k — 3n). 

For k > n — clog n (0 < c < 1), we would get 

2 n > 2 n - clogn (4(n - c log n) - 3n) = 2 " n ~ 4clogn > 2 n 

n c 

for every n > no for some no, as linin^oo \ = 00 and limn^oo 4c log n = 0. Which is 
the final long worked for contradiction. 

The proof of Theorem 1 is now at hand. Consider the difference /(n) = cmp(n) — 
n + logn. On one hand, Theorem 3 states that f(n) > —2 log log n = o(logn). On 
the other hand, Thereom 4 yields that for every e > 0, f(n) < elogn for every large 
enough n (by setting c = 1 — e), and hence f(n) = o(logn). 
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5 Computational Complexity 

We first restate Thereom 2 in a slightly stronger form: 

Theorem 5. For every fixed k > 3, it is NP-complete to decide //cvnp(G') < k for an 
input graph G, even if G is promised to have complicacy at most k + 1. 



Proof. We reduce from the following hypergraph coloring problem: Given a set T of 
triples over a base set X, decide if the elements of X can be colored by three colors 
so that every triple T £ T receives all three colors. This problem is NP-complete as it 
contains the 3-edge colorability of cubic graphs as a subproblem, the latter being NP- 
complete by Holyer [6], (By resctricting to 3-edge colorability of bridgeless graphs, we 
may assume that X is colorable by 2 colors so that no triple T £ T is monochromatic.) 

Given such a T, we construct a graph Gj- = ( V. E ) as follows. The vertex set will 
consist of a cycle of length 2k on vertices iti, u 2 , ■ ■ ■ , u 2k , a pair of vertices a x , b x for 
every x £ X , and a vertex ct for each triple TeT. The c vertices form a clique and 
each of them is adjacent to all u vertices with even subscripts, to all b vertices, and to 
those a vertices which correspond to x’s belonging to the particular triple. The b vertices 
are further adjacent to their corresponding a vertices and to 112 , 1 / 4 , u <:, ■ Formally 

V = {ui\i = 1, 2, . . . , 2k} U {a x , b x \x £ X} U {ct|T £ T} 

E = {uiu 2 , u 2 u 3 , . . . , u 2k ui } U {a x b x \x £ X} U {b x Ui\x £ X, i = 2, 4, 6} 
l){b x b v \x,y £ X} U { c T a x \x £ T £ T} U { c T b x \T £ T, x £ X} 
U{ctCs|T, S' £ T} U {c T Wi|T £ T, i = 2,4, 6, ... , 2k}. 

We claim that Gj- is always a PC graph of complicacy at most k + 1, and its 
complicacy is (at most) k if and only if T allows a coloring as desired. 

Let R be a PC representation of Gp. We first note that all polygons R ax , x £ X 
must be positioned around the circle. For if one of them would block another two from 
each other, some R x would block some R y from R U2 and the auxiliary polygon If 
would not be able to intersect both R y and Ii„ 2 . 

As observed in Proposition 1, the polygons -R U2i ,i = 1,2, . . . ,k are positioned 
around the circle, and only these vertices of the cycle are intersected by R CT ’s. Since 
the polygons representing the b’s intersect (only) R U2 ,R Ui and R Vf . , all the polygons 
representing the a vertices must lie in the segments determined by R U2 , R Ui and R Ufj . 
So color an element x £ X by color i if R ax is blocked by R Ui from the other R Uj , 
i,j = 2,4, 6. We claim that this is a good coloring. For if a triple T £ T contained two 
vertices of the same color, say x and y of color 2, the polygon R Ct would need k — 1 
corners for intersections with R u . . 1 =4,6...., 2k, plus at least two more corners for 
intersections with R x and R y , that is at least k + 1 corners altogether. 

On the other hand, given a feasible 3-coloring of X by colors 2,4,6, a representation 
by fc-gons can be achieved by placing the polygons /?„, , x £ X around the circle 
such that polygons corresponding to vertices of color i are placed within the segment 
determined by the chord R Ui ,i = 2,4,6. Each c vertex can then be represented by 
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a fc-gon with one comer in each segment determined by R Ui , i = 2, 4, 6, , 2k. An 
illustrative example is in Figure 6, where a triangle representing a triple T = {x. y, z} 
is marked. For the sake of simplicity, we only illustrate the case k — 3, and the auxiliary 
polygons Rt, : are not shown (drawn with invisible ink). 

Along the same lines it is seen that cmp(G 7 -) < k + 1 if (A, T) allows a 2-coloring 
without monochromatic triples. 



One may ask for what function f(ri) of n it is still hard to decide if cmp(G) < f(n) 
for an input polygon-circle graph G on n vertices (since we do not trust the polynomiality 
of polygon-circle graph recognition, we need to assume that the input graph comes with 
the promise of being a polygon-circle one). As we have seen in previous sections, /(n) 
must be somewhat smaller than n so that we might expect NP-hardness. By a closer 
examination of the proof of Theorem 2 we see that we can get close to j : 

Theorem 6. For every rational c < it is NP-complete to decide whether cmp(G) < 
cnfor an input graph G on n vertices. 



Proof. Consider the previous construction of Gp. If we start the reduction from 3-edge 
coloring of a graph with 2m vertices and 3 m edges, we get 




Fig. 6. Illustration to the NP-completeness proof. 



n=\V\ = 2k + 2\X\ + \T\ = 2 k + 8m. 
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Hence setting n = (if 1 — 2c > 0 is rational, we may start with sufficiently many 
copies of the cubic graph to make n an integer) we get 

2 k = n — 8 m = 8 m(- — 1) = 2 cn 

and hence our previous proof shows that deciding 

cmp(G 7 -) < k = cn 



is NP-complete. 



6 Conclusion 

Regardless of whether recognition of polygon-circle graphs turns out polynomial or NP- 
complete (if dichotomy is expected), our results about NP-hardness of polygon-circle 
graphs of bounded complicacy will be an interesting counterpart to the complexity of 
recognition of the whole class. We do hope that our paper will revitalize interest in the 
polygon-circle recognition problem as well. 
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Abstract. We prove that a contact system of Jordan arcs is stretchable 
if and only if it is extendable into a weak arrangement of pseudo-lines. 

1 Barycentric Representations of Graphs 

1.1 Tutte’s Barycenter Theorem 

It is a classical result independently established by Wagner [11], Fary [2] and 
Stein [6] (which is also a consequence of the Steinitz’s theorem on convex poly- 
topes [7] ) that any simple planar graph has a straight line representation in the 
plane. In the early 60’s, Tutte [9] [10] gave a way to build a convex embedding 
of a 3-connected planar graph: 

— Let S be the vertex set of a face Fq of G in a planar embedding of G, 

— let / : S — > 1R 2 be a mapping of the vertices in S to the points of a convex 
polygon, in such a way that the order of the points on the polygon is the 
same as the order of the vertices on the face Fq, 

— let F : V(G) \ S — > IR 2 be the solution of the linear system 



Theorem 1 (Tutte’s theorem). F defines an embedding of G into the plane, 
with strictly convex interior faces. 

In his quite long and difficult proof, Tutte simultaneously reproves Kura- 
towski’s planarity criterion. Other proofs of this theorem have been published 
since. This result has been generalized by Thomassen to infinite planar graphs 
in [8], while Linial, Lovasz and Wigderson proved the equivalence of the k ver- 
tex connectivity and the existence of a convex embedding (for any k extremal 
vertices) in general position in lR fc [3]. In Tutte’s construction, each vertex in 
V(G) \ S is the barycenter of its neighbors. Although Tutte assumed all the 
coefficients to be equal, his proof extends without changes when coefficients are 
positive reals. In the aim of studying straight line segments contact systems, we 
shall first give a short proof of an extension of this theorem, in a framework 
allowing null coefficients, giving some side results necessary for our study on 
stretchability. 





{v,x}£E(G) 




(i) 



G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 71-85, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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First, we shall give necessary and sufficient conditions for such a barycentric 
linear system to have a unique solution (without planarity assumptions), and 
for its solution to define a plane embedding (assuming that the graph is planar 
and that the source set is embedded on a convex polygon with a compatible 
point order). Instead of the classic approach consisting in directly proving the 
planarity of the barycentric representation, we will make use of continuity argu- 
ments, while considering an homotopy from some initial straight line embedding 
of a triangulation of the graph (see also [1] [4] for alternative approaches and 
generalizations of Tutte’s Theorem) . 



1.2 Definitions and Notations 

A barycentric system £ on a graph G is a triple (S, /, a), where the source set 
A of A is a subset of vertices of G, the limit function f of £ is a function from 
S to lR fc , the weight function a of A is a function from V(G) x V(G) to [0,1], 
and such that: 

Vx, 3 / £ V(G), a(x,y) yf 0 => {x 7 y} £ E{G) and x $ S ( 2 ) 

Vx £ V(G) \S, a ( x ’ = 1 (3) 

(. x,v)£E(G ) 

A solution of A is a function P : V(G) — > IR fe , such that: 



\P(x) 



f(x), if x £ S' 

Ei>ev(G) a ( x ’ v )'P( v )i otherwise 



(4) 



Remark 1. Notice the following equivalences: 

x £ S a(x,v) = 0 Vx £ 1 f(G), a(x,v) = 0 

(x,v)eE(G) 

Let £ = (S, /, a ) be a barycentric system on a simple graph G. Let A be a 
subset of V(G), the relative source set S(A) of A is defined by: 

S(A) = (S (~l A) U {x £ A, 3v £ A, a(x, v ) yf 0} (5) 

That is, Vx £ A: 

x S(A) •<=>■ a(x,v) = l (6) 

(x,v)eE(G A ) 

Notice that, according to this definition, S(V(G)) = S. 

Lemma 1. Let G be a simple graph, let £ = ( S, /, a ) be a barycentric system 
on G. Assume P is a solution of £ . 
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Let A be a subset of V ( G ) . We define a a : A x A — > [ 0 , 1 ] by: 



Vx, y & A, a A {x,y) 



0, i/i£ S(A) 

a(x,y), otherwise 



Then, £ A = (S(A), F\g(A)i «a) a barycentric system on G A , having F\ A as a 
solution. 



Proof. According to the definition of S(A ) , the function a a is such that: 

Vx,y £ A, a(x,y) 0 => {x,y} £ E(Ga) and x S{A) 

Mx £A\S(A), a(x,v) = l 

( x,v)gE(Ga ) 

Thus, E a is a barycentric system on Ga , obviously having F\a as a solution. □ 

A barycentric system E = {S, /, a) on a simple graph G defines a directed graph 
Ds on the same vertex set as G , having S has its set of sources: 

Mx,y£V(G), (x,y) £ E(D S ) a(y,x) ± 0 (7) 



Let G be a simple graph of order n. Consider any fixed numbering of the 
vertices of G. A barycentric system E = (S, f, a) on G define a n x n square 
matrix A (with coefficients in [0,1]) and a n column matrix F (with coefficients 
in lR fc ) by Aij = a(vi, Vj) and 



Fi = 




if Vi £ S 
otherwise 



So, the solutions of E correspond to the solutions F of the equation T = AF + F. 

Let G be a simple graph, E = ( S,f,a ) a barycentric system on G, and Dg 
the associated directed graph. 

The distance function distg : V(G) — > IN U {oo} is defined as the minimum 
length of a directed path in Ds from S' to a vertex, or oo is such a path does 
not exist. 



1.3 General Resolution 

In the next lemmas, G is assumed to be a simple graph, and E = ( S,f,a ) 
to be barycentric system on G. Elements of lR fc are compared with respect to 
lexicographic order. 

Lemma 2. If sup„ e y( G ) dists(u) < oo, any barycentric function F reaches its 
extremal values on S and, for any vertex x, there exists in D 

— a directed path Pff from S to x such that (F, distg) is strictly increasing on 
P+ (with respect to lexicographic order). 
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— a directed path P x from S to x such that (fL, —dists) is strictly decreasing 
on P~ (with respect to lexicographic order). 

Proof. We only have to prove the existence of P+ as the existence of the directed 
path P~ is then obtained by considering —L r instead of <F, and the monotony of 
these paths implies that the extrema of are reached on S. 

Assume there is at least a vertex v such that such a path P+ does not 
exist, and choose the one with the smallest possible value of (<F(u), dists (u)). 
Obviously, v does not belong to S. So, 



<F(u) 



E 

(w,v)e.E(Dz;) 



a(v, w)d r (w) > 



min I'(w) 

( w,v)eE(D s ) 



— If there exists an arc (x,v) £ E(Ds) such that \P(x) < \L(v), then P+ exists 
(by minimality assumption on v) and the directed path P+ = P+ + (x, v) 
contradicts the hypothesis. 

— Otherwise, any edge (x, v ) £ E(Ds) is such that \P(x) = Hs(v). As there exists 
an edge (x,v) £ E(Ds), such that dists (a;) = dists(u) — 1 the directed path 
P+ exists, and P+ = P+ + (x, v ) contradicts the hypothesis. 

In both cases, we are led to a contradiction. □ 



Lemma 3. If sup w£ y( G ) dists (v) < oo, E has a unique solution <fi. 

Proof. The function (j) exists and is unique if and only if the kernel of A is {0}, 
that is, if and only if the barycentric system ( S , /o, a) has no non zero solution, 
where /o is a constant function mapping S to 0. But this is an implication of 
the previous result, which asserts that the extrema of a solution of a barycentric 
system are reached on its source set, that is that a solution has the same extremal 
values as the limit function. □ 



Lemma 4. If sup„ e y( G ) dists(u) = oo, the barycentric system has an infinite 
set of solutions. 

Proof. Let Vs be the subset of vertices reachable from S' by a directed path. For 
any value c £ IR fe , define the function f : ( V (G)\Vs)US —*■ IR fc by f'(v) = f(v) if 
v £ S, and f'[v) = c, otherwise. According to Lemma 3, the barycentric system 
((V(G) \ Vs) U S, a, f) has a solution </>, which is also a solution of E. This 
solution is such that 4>\v(G)\v s — c j an< ^ thus we get a infinite set of solutions of 
E. □ 

Theorem 2. Let G be a simple graph. Any barycentric system E = (S, /, a) 
has a solution, which is unique if and only if any vertex of the directed graph 
associated with E may be reached from S by a directed path. 

In the latter case, the solution (j) reaches its extremal values on S and, for 
any vertex x, there exists in D z : 
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— a directed path P(f from S to x such that (</>, dists) is strictly increasing on 
Pff (with respect to lexicographic order). 

— a directed path P~ from S to x such that (< j>, — dists) is strictly decreasing 
on P~ (with respect to lexicographic order). 

Proof. This is a direct consequence of the three preceding propositions. □ 



Corollary 1. Let G be a simple graph and let £ = ( 5, /, a ) be a bary centric 
system on G having a unique solution. 

Then, for any subset A of V(G), | 5(A) | is at least equal to the number of 
connected components ofGA- 

Proof. It is sufficient to prove the case where Ga is connected. Let x be an 
element of A. If x belongs to 5, we are done. Otherwise, as £ has a unique 
solution, there exists, according to Theorem 2, a directed path P from S to x. 
This path enters A at vertex v which, by definition, belongs to 5(A). □ 



1.4 Planarity of a Barycentric Representation 

Barycentric representations and convex drawings. Let G be a 2- 

connected planar graph. A straight line drawing of G is a convex drawing if 
each face of the graph is drawn as a convex polygon. The drawing is strictly 
convex if the vertices of a face are mapped to the extremal points of the poly- 
gon corresponding to the face. A barycentric representation of G with source 
set 5 C V{G) is a mapping from V{G) to 1R 2 , defining a straight line planar 
drawing of G, and such that any vertex not in 5 is a barycenter of its neighbors 
(with coefficients in [0,1]). Convex drawings are deeply related to barycentric 
representations : 

Lemma 5. Let G be a 2-connected planar graph. A straight line representation 
of G is convex if and only it is a barycentric representation of G, which source 
set S is the vertex set of a face of G and is embedded on a convex polygon (in a 
compatible order). 

Proof. Consider a convex drawing of G and let 5 be the vertex set of the external 
face. Let v be an internal vertex. As the faces including v are convex, we may 
augment the neighborhood of v to a wheel by adding straight line segments 
between consecutive non-adjacent neighbors while preserving planarity. Thus, v 
belongs to the convex hull of its neighbors. Hence, the drawing is a barycentric 
representation of G with source set 5. 

Conversely, assume 5 is the vertex set of a face of G embedded on a convex 
polygon of the plane. Assume the representation includes a non convex internal 
face, and let V\,V 2 ,V 3 be three consecutive vertices of such a face defining a 
concave angle at V 2 - Then, V 2 does not belong to the convex hull of its neighbors, 
a contradiction. □ 
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Corollary 2. The embedding defined by a convex drawing of a 2-connected pla- 
nar graph G is uniquely determined by the choice of the external face. 



Corollary 3. Let P be a convex polygon, let {p\,P 2 ,Ps) be a triangle including 
P in its interior, let G be the graph induced by P, the triangle and a triangulation 
of the annulus between the triangle and the polygon, let Si,S 2 ,S 3 be the vertices 
of G mapped to Pi,P 2 ,Ps and let g(si) = pi (for 1 < i < 3). 

Then, there exists a function ap : V(G) x 1 /(G) — > [0,1], such that E = 
({si, S 2 5 S 3 },g, ap) is a barycentric system on G with unique solution T mapping 
G to its geometric embedding. A simple geometric argument shows that a may 
be chosen in such a way that a{x, y) > 0, for any pair ( x , y) of points adjacent 
on P. 

Barycentric Representations 

Lemma 6. Let G be a maximal planar graph and let p : [0,1] x V(G) — > 1R 2 
be a mapping such that 

— p(0, ■ ) induces a plane straight line representation of G, 

— p{ ■ , x) is continuous for any vertex x of G, 

— Vf G]0,1] and for all triangle (cci, £ 2 , £ 3 ) ofG, p(t, X\), pit, x^) and pit, x^) 
are not collinear. 

Then, p(t, ■ ) induces a plane straight line representation of G for every t in 

[ 0 , 1 ]- 

Proof. In order to check if a straight line representation of a maximal planar 
graph is plane, it is sufficient to check that no edge has null length, that no two 
adjacent edges are overlapping and that the circular orders of the edges around a 
vertex define a planar map. The first two cases may not occur, as it would imply 
the existence of an aligned triangle. As p{ ■ , x) is continuous for any vertex x, the 
circular order around a vertex may only change if two consecutive edges overlap 
for some t, which would give rise to a contradiction. □ 

Given a weak plane straight line representation of G, a connected subgraph H is 
0 -degenerated if all its vertices are mapped to a same point. It is 1-degenerated 
if all its vertices are mapped to aligned points. A k-degenerated component is a 
maximal /c-degenerated connected subgraph (it is thus an induced subgraph of 
G). 

Lemma 7. Let H be a 2-connected plane graph, and let E = {Sp, fHiOtn) be 
a barycentric system on H having a unique solution Tp, where Sp is the vertex 
set of the outer face of H . 

Assume fp maps Sp to a convex polygon P in compatible order, and that 
the following condition holds for any subset A of vertices inducing a connected 
subgraph: 
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(ii) | S(A) | = 2 => Ha is a path 

Then, there exists a maximal planar graph G and a barycentric system E = 
( S , /, a) having a unique solution, such that: 

— H is a subgraph of G, 

— is the solution of Ep, 

— conditions (i) and (ii) hold for G (replacing Ha by Ga, and considering 
relative source sets according to E) 

Proof. Denote S ^ and S ^ the relative source set functions with respect to 
H, Ep and G, Eq, respectively. 

The graph G\ is obtained by adding a vertex in each interior face of H, 
adjacent to all the vertices of the face. 

Let {pi,P2,P3} be a triangle having the polygon P in its interior, let G2 
be the graph obtained by geometrically triangulating the annulus between the 
triangle and the polygon, let si, S2, S3 be the vertices of G 2 corresponding to the 
Pi,P2,P3, let / be the mapping defined by /(s*) = pi, and let ap be such that 
({si, S2, S3}, /, ap) is a barycentric system on G2 having a unique solution T, 
which restriction to S is fp (such a function exists according to Corollary 3). 

The graph G is obtained from the union of Gi and G 2 , by identifying the 
vertices corresponding to elements of S. 

Then, define a : V(G) x V(G ) -A [0,1] as follows: 



a{x,y) 



(a H {x,y), 

J oip(x, y), 

1 

d(x ) ’ 

,0, 



if x,y G V(H) 
if x,y G V(G 2 ) 

if x 6 V"(Gi) \ V(H) and y G V{H), 
otherwise 



( 8 ) 



By construction, H is a subgraph of G. It is easily checked that E = 
({si, S 2 , S3}, /, a) is a barycentric system on G having a unique solution, which 
restriction to V(H) is a solution of Ep (according to Lemma 1). 

Consider any subset A C V (G) inducing a connected subgraph Ga of G, such 
that | Sg {A) | < 2 and Ha has no cycle. As no vertex x G A \ V ( H ) \ (si , S2 , S3} 
may have all its neighbors in A (for otherwise its neighbors would form a cycle 
in Ha), any vertex in A \ V(H) belongs to S^^A). By definition, for any 
x G S( H \A n V{H)) \ Sp, there exists v G V(H) \ A, ap(x,v) > 0. Thus, 
a(x,v) > 0, what implies x G S’1 G 1(A). If (A) D Sp is not empty, it is easily 
checked that either S ( ' H \A) D Sp is included in (A), or Sp is included 
in A (and hence Ha includes a cycle), or S^ H \A) has cardinality at least 3. 
Altogether, we get: 



S ( ' G \A) = S {h \A n V(H)) U(A\ V(H)) (9) 



Assume there exists a subset A of V (G) inducing a connected subgraph of 



G. 
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If | £( G )(A) | = 1 then, according to (9), either A C V(H) and | S( H \A) | = 
1, or A fl V(H) = 0 and | A |=1. Hence, (i) holds. 

If | S( G \A) | = 2 then, according to (9): 

— either A C V{H) and | S^ H \A) | = 2, and hence Ga = Ha is a path, 

— or A \ V(H) = 1 and | S^ H \A) | = 1, and hence | A | = 2 and Ga is a path, 

— or in V(H) = 0 and A is a subset of size 2 of {si, s 2 , S3} (hence inducing 
a path). 

Thus (ii) holds for G. □ 



Theorem 3. Let G be a 2 -connected plane graph, S the vertex set of its outer 
face, f a function mapping S to the points of a convex polygon in compatible 
order, and a : V(G) x V(G) — > [0,1], such that E = ( S,f,a ) is a bary centric 
system on G with a unique solution T . 

Then, T defines a barycentric representation of G if and only if, for any 
subset A ofV(G) inducing a connected subgraph, the following assertions hold: 



(1) S(A) 
(ii) S{A) 



1 =>\A\ = 1 

2 => Ga is a path 



Proof. First, notice that the necessity of conditions (i) and (ii) is straightforward. 

Assume (i) and (ii) hold. According to Lemma 7, we may assume that G 
is maximal planar and that / maps its outer face {si, 32,53} to a triangle 
{PhP2iP3}- It is a classical result [2,6,11] that G has a straight line embedding 
r with external face {si,S2,S3}. Moreover, using an affine transformation, we 
may assume that si, S2, S3 are mapped to Pi,P2,P3 in this embedding. According 
to Lemma 5, this is a barycentric representation, and thus induced by the solu- 
tion of some barycentric system E' x = ( S,f,a '). Let at = (1 — t)a' + a. Then, 
E^ = (5, /, a t ) is obviously a barycentric system having a unique solution 
continuously depending on t. Let p{t,v) = T^\v). If, for any t € ]0,1], there 
exists no triangle (xi, 22, £3) such that p(t, X\), p{t, x-f) and p (i, 23) are collinear 
then, according to Lemma 6, T = p(l, ■ ) will define a barycentric representation 
of G. First notice that, if 0 < t < 1 then, for any 2 ,y £ V{G), a t (x,y) > 0 if 
and only if a(x,y) > 0 or a'{x,y) > 0. Thus, for any subset A of vertices, the 
relative source set of A computed according to E t includes the relative source 
set of 5'(A) computed according to E. Hence, as (i) and (ii) hold for t = 1, these 
conditions also hold for any t G ] 0 , 1]. Moreover, if conditions (i) and (ii) imply 
that no triangle (21,22,23) of G is such that p(l,Xi), ^(1,22) and ^(1,23) are 
collinear then the same will apply for any t £ ] 0 , 1 [. 

Assume there exists a triangle {21,22,23} of G , such that ^(21), ^(22) and 
$■(23) are collinear. Let Ga Q {21, 2 2 , 23} be the corresponding 1-degenerated 
component, and let ax + by + c = 0 be an equation of the straight line A 
including T(A). Consider the barycentric system Ea = ( S,af x + bf y + c, a) 
(where (f x (v), fyi v )) = /(v)) and its unique solution T. The subgraph Ga is 
then a connected component of •f'~ 1 (0). 
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Let G + be the directed planar graph obtained from G by adding to vertices 
lu~ and lo + and the edges {w“,s} (if s £ S and cp(s) < 0), {w + ,s} (if s £ 
S and </>(s) > 0) and {lo~,uj + }. 

For each vertex ai £ S'(A) \ S, there exists a vertex v such that {v,af) is an 
arc of Dz a and J/{v) ^ 0. From the equation 0 = ^{af) = ai )^E(D s ) 
we deduce that there exists, for each such aj, two vertices Xi and z/^, such that 
(xi,cii) and ( yi,ai ) are arcs of D% A and such that ^(xf) < 0 < J/fyf). According 
to Theorem 2, there exists in D Sa , for each such , directed paths P+ and P~ , 
whose vertices (except af) have a negative (resp. positive) i'-value. 

Thus, there exists, in G + , for each at £ S'(A), paths Pi(ai) and P 2 (aj), from 
uj~ (resp. u> + ) to Oj, which vertices (except a^) have a negative (resp. positive) 
<F-value. Hence, we have: 

V{a,a'} C S{A) P^a) n P 2 (a') = 0 

Va £ S(A), Pi (a) fll = P 2 (a) Hi = Pl (a) H P 2 (a) = {a} 

Va £ S(A), u)~ £ Pi(a) and w + £ P 2 (a) 

Hence, two cases may occur: 

— A = S(A). Then, the triangle {xi,x 2 ,x 3 }, the paths P-,{xj) ( i £ {1,2}, j £ 
{1,2,3}), and the edge {w“,w + } may be contracted to K 5 , contradicting 
the planarity of G + . 

— there exists a vertex x in A \ S(A). According to the 3-connexity of G + , 
there exists 3 disjoint paths from x to w + . By continuity, these paths leave 
A at some distinct vertices ui,u 2 ,U 3 £ S(A). Then, the subpaths from x to 
vi,V 2 ,V 3 , and the paths Pi(vj ) (i £ { 1 , 2 } , j £ {1,2,3}) may be contracted 
to a A 3 , 3 , contradicting the planarity of G + . 

In both cases, we are led to a contradiction. Thus, no triangle of G may be 
mapped to 3 collinear points. Hence, is a barycentric representation of G. □ 



2 Jordan Arc Contact Systems 

2.1 Introduction 

A Jordan arc is an arc homeomorphic to a straight line segment. A pseudo-line 
is an arc homeomorphic to a straight line. A contact system is a set of Jordan 
arcs in the plane, such that two arcs intersect at most once at a point which 
is internal to at most one arc. A contact system is stretchable if there exits 
an homeomorphism which transforms it into a contact system whose arcs are 
straight line segments. 

A set £ = (Li, . . . , Lfc) of pseudo-lines is a weak arrangement of pseudo-lines 
if it satisfies : 



V* 7^ j , 



Li n Lj | < 1 



( 10 ) 
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A weak arrangement of pseudolines is an arrangement of pseudolines if it satisfies 
the additional axiom : 

Vi, j, k, (Li (1 Lj =9 and (~l Lk yf 0) => Lj D Lk ^ 0 

A contact system A = (A±,...,Ak) is extendable into an arrangement of 
pseudo-lines (resp. into a weak arrangement of pseudolines) if there exists a 
arrangement of pseudo-lines (resp. a weak arrangement of pseudolines) £ = 
(Li, . . . , Lk), such that A* C for each i e {1, . . . , k}. 

Remark 2. It is easy to prove that a contact system is extendable into an ar- 
rangement of pseudo-lines if and only if it is extendable into a weak arrangement 
of pseudo- lines. 

2.2 Extendibility and Extremal Points 

An extremal point of a contact system is a point of the union of the arcs which 
is interior to no arc. We note 5(A) the set of the extremal points of the contact 
system A. A maximal contact system is a contact system whose extremal points 
belong to the unbounded region. A contact system A = (Ax, ■ ■ ■ , Ak) is extend- 
able into a maximal contact system if there exists a maximal contact system 
A! = (A'x , . . . , A' k ), such that A , C A! i for each i e {1, . . . , k}. 

Remark 3. If a contact system is extendable into a weak arrangement of pseudo- 
lines, it is extendable into a maximal contact system, which in turn is extendable 
into a weak arrangement of pseudo-lines. 

Let A be a maximal contact system extendable into a weak arrangement of 
pseudo-lines C. 

— Let p be an interior point of an arc A (but no other arc) on the unbounded 
region of A. 

— Let L be the pseudoline extending A in £. and let L~ and L + be the two 
halves of L delimited by p. 

— Let H be a half pseudo-line having its endpoint at p and having no other 
intersection in A. 

Then, L and H induce a “partition” of A into three new contact systems A~, A + 
and A a (see Fig. 1): 

— H,L~ and L + define three unbounded regions 1Z~ ,1Z + and 1Z° having H U 
L ~ , H U L + and L as respective frontiers. 

— We define J ~ , J + and J° as the sub-arcs of H U L ~ , H U L + and L strictly 
including all the intersections of these later pseudolines with A. 

These regions define three contact systems of Jordan arcs : 

A” ={ J~} U {A fl 7 Z~ , A € A and A D 1Z~ is a nonempty arc} 

A + ={=/ + } U {A fl 1Z + , A € A and A fl 1Z + is a nonempty arc} 
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Fig. 1 . Partitioning a Contact System 



*4° ={ J 0 } U{Jn 1Z°, A £ A and A fi 7Z° is a nonempty arc} 



These three contact systems have some nice properties : 

Lemma 8. The contact systems and A 0 are maximal and extendable 

into weak arrangements of pseudo-lines. 

Proof. The maximality is straightforward, as the extremal points of these contact 
systems are either extremal points of A or the extremities of J ~ , J + or J° . 

The contact systems A~ and A + are homeomorph to the contact systems 
where J~ (resp. J + ) is replaced by J°. Then, the weak arrangement of pseudo- 
lines which extends A is an extension of all of the contact systems. □ 

Lemma 9. If the arc A includes no extremal point of A. we have : 

2<\A~\<\A\, 2 < | A + | < | -4 | , 3 < | -4° | < | .4 | , 

and each of A ~ , A + and A 0 has two extremal points belonging to a same arc. 

Proof. As A includes no extremal point of A the extremity of A in A~ (resp. 
A + ) is interior to some arc B~ (resp. B + ). As A may not intersect an arc twice, 
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we have B~ ^ B + . Moreover, each of these arcs intersects at most once the 
extension L of A, and hence the arc B~ (resp. B + ) does not meet the region 1Z + 
(resp. TZ~). Thus, | A + | < | A | — 1 and the same holds for | A | . Notice also 
that A~ and A + include at least two arcs : J~ (resp. J + ) with two extremal 
points on it and the non empty portion of B~ (resp. B + ) in 7 Z~ (resp. TZ + ). 
Moreover, A 0 includes at least three arcs : J° (with two extremal points on it) 
and the portions of B~ and B + in 1Z°. □ 



Lemma 10. Any contact system extendable into an arrangement of pseudo- 
lines has at least 3 extremal points, unless it has cardinality at most one. 

Proof. First notice that we may prove the lemma for maximal contact systems 
only, according Remark 3 and the fact that the extended contact system may 
not have more extremal points than the original one. 

The lemma is straightforward for contact systems of cardinality at most 
two. So, assume that for any 2 < i < k, the lemma holds for maximal contact 
systems of cardinality i and assume there exists a maximal contact system A of 
cardinality k having at most 2 extremal points. 

Notice that at least 3 arcs of A meet the unbounded region in a sub-arc. 
Otherwise, the contact system would be bounded by a single arc or by two arcs 
intersecting each other twice, and the system could not be extended into a weak 
arrangement of pseudo-lines. 

— Assume A has two extremal points belonging to a same arc. 

From the previous construction and according to Lemma 8, A~ and A + are 
extendable into weak arrangements of pseudo-line and, according to Lemma 
9, have a smaller cardinality than A but include at least two arcs. By induc- 
tion, they have at least 3 extremal points. Among these, there is one which 
is an extremal point of A belonging to the interior of 1Z~ (resp. 1Z + ). Notice 
that these extremal points do not belong to a same arc : otherwise, this arc 
would intersect A twice. Hence, as the contact system A has an arc with 
two extremal points on it, it has at least 3 extremal points, what leads to a 
contradiction. 

Hence, the lemma holds for any maximal contact system with at most k arcs 
having two extremal points belonging to a same arc. 

— Assume A does not include two extremal points belonging to a same arc. 
From the same construction and according to Lemma 8, the contact system 
A 0 is maximal and, according to Lemma 9, it has cardinality at most k and 
has two extremal points belonging to a same arc. According to the previous 
case, it has at least 3 extremal points. Among these, one is an extremal point 
of A belonging to the interior of 1Z° . Together with the 2 extremal points of 
A belonging to the interior of 7 Z~ and 1Z + (as in the previous case), we get 
3 extremal points, which lead to a contradiction. 

□ 
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Lemma 11. Any contact system A extendable into a weak arrangement of 
pseudo-lines is extendable into a maximal contact system, each subsystem of 
which has at least 3 extremal points unless it has cardinality at most one. 

Proof. As any subsystem A! of the extension of A into a maximal contact system 
is extendable into a weak arrangement of pseudo-lines, the result follows from 
Lemma 10. □ 



2.3 Stretching 

Consider a maximal contact system of Jordan arcs A, and the plane graph G 
whose vertices are the points which are extremity of a least an arc of A and 
whose edges are the sub-arcs between such points. 

Let S be the set of the vertices of G corresponding to points which are interior 
to no arc in A. 

Let / be a function mapping S to the vertices of a convex polygon, in com- 
patible order. 

Let a : V ( G ) x V ( G ) — > [ 0 , 1 ] be a function such that: 

— a(x, y) = 0 if and only if y is interior to no arc in A or x and y are non 
adjacent in G. 

— a(xi ,y) + a(x 2 , y) = 1 if y is interior to some arc A £ A and has X\ and X 2 
as neighbors in A. 

Then, S = (S, /, a) is a barycentric system on G. 

Notation 1. Given an arc A £ A, the path P(A) is the path of G, whose vertices 
and edges are induced by A. The set V of these paths induces a covering of G by 
edge disjoint paths. 

By extension, qiven a contact system A ' , P(A') denotes the union of the 
paths P{A) (A £ A'). 



Notation 2. Let X C V(G). We note A(X) the set of the arcs of A having at 
least two points corresponding to vertices in X. 

We now may link Theorem 3 with the stretching problem, using the next 
notations and lemmas. 

Lemma 12. Let A' be a subsystem of A. Then S(V(P(A'))) = 5{A'). 

Proof. A vertex of X = V(P(A')) belongs to the S(X) if and only if it has no 
incoming edge in D % (hence is an extremal point of A) or if its has an incoming 
edge in Ds from a vertex which does not belong to X (and thus is internal 
to some arc in A \ A' or, equivalently, is an extremal point of A' but not an 
extremal point of A). Thus, S(X) is the set of the extremal vertices of A' . □ 
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Lemma 13. Let X C V(G) have cardinality at least 2 and induce a connected 
subgraph of G. Then, 



X CV(P(A(X))) (11) 

\S{X)\ >|S(y(P(.4(X))))| (12) 

Proof. Inclusion (11) is straightforward as each edge of Gx is included in the 
path corresponding to some arc of A(X). 

Consider the connected graph H obtained from Gx by adding k > 0 paths 
corresponding to arcs in A(X) and assume that | S(V(H)) | = | S(X) | holds. 
Let A be some arc of A(X), such that P(A) does not belong to H. 

— Assume H include two vertices v and w of P(A) but none of the edges of P 
linking them. 

Then, by adding those missing edges and vertices, no vertex is added to 

S(V(H)). 

After this first completion, the internal vertices of P(A) have at most one 
incoming edge from the complement of H. 

— Otherwise, H includes an interior vertex v of P(A ), but none of the vertices 
between v and an extremity w of P. 

Then, v belongs to S(V(H)). By adding the sub-path from v to w, exactly 
one vertex is added to the S(V(H)), namely w. As P(A) has at least two 
vertices in common with H , v is an interior vertex of P(A). Moreover, as 
the previous case does not apply, v had initially one incoming edge (exactly) 
from the complement of H' . Thus, after the addition of the sub-path from v 
to w, the vertex v will no longer belong to S(V(H)). 

Thus, we may add a new path to H without increasing | S(V(H)) |, which com- 
pletes the induction. □ 



Theorem 4. Let A be a contact system of Jordan arcs. Then, the following 
conditions are equivalent : 

(Ci) A is stretchable, 

( C 2 ) A is extendable into a weak arrangement of pseudo-lines, 

Proof. It is straightforward that Condition (C\) implies condition (C 2 ). Accord- 
ing to Lemma 11, this second condition implies that A is extendable into a 
maximal contact system, each subsystem of which has at least 3 extremal points 
or has cardinality at most one. Consider any subset X of vertices of G with 
cardinality at least 2 and inducing a connected subgraph of G. Then, according 
to Lemma 13 and Lemma 12, | S(X) | > | S(V(P(A(X)))) \ = | S(A(X)) |. But, 
either A(X) includes a single arc and hence Gx is a path and | S(X) | > 2 or 
A(X) includes at least two arcs and then | S(X) | > 3, according to the hypoth- 
esis. The result now follows from the application of Theorem 3. □ 
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3 Conclusion 

The equivalence between stretchability and extendibility into an arrangement of 
pseudo-lines for contact systems of Jordan arcs does not extend to intersection 
systems (stretchability of pseudoline arrangements is NP-hard, as proved by 
Short [5]). With more work, another equivalent condition may be given : A 
contact system is stretchable if and only if any subsystem has at least 3 extremal 
points on its unbounded region, unless it has at most an arc. 
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Abstract. A geometric graph is a graph embedded in the plane in such a way that 
vertices correspond to points in general position and edges correspond to segments 
connecting the appropriate points. A noncrossing Hamiltonian path in a geometric 
graph is a Hamiltonian path which does not contain any intersecting pair of edges. 
In the paper, we study a problem asked by Micha Perles: Determine a function 
h, where h(n) is the largest number k such that when we remove arbitrary set 
of k edges from a complete geometric graph on n vertices, the resulting graph 
still has a noncrossing Hamiltonian path. We prove that h(n) = f2(y/n). We also 
determine the function exactly in case when the removed edges form a star or a 
matching, and give asymptotically tight bounds in case they form a clique. 



1 Introduction 

A geometric graph is a graph drawn in the plane so that its vertices are represented 
by points in general position (i.e., there are no three collinear points) and its edges are 
straight-line segments connecting the corresponding vertices. 

Lately geometric graphs have been intensively studied. There are many papers study- 
ing the smallest number of edges needed to guarantee the occurence of some fixed 
subconfiguration in any geometric graph (the trivial result of this type following from 
Euler’s polyhedral formula is that any geometric graph with at least 3n — 5 edges must 
have two edges which intersect). Interesting results of this sort may be found in [1], [4], 
[5], [6], [7] to name a few or in the surveys on geometric graphs ([2], [3]). 

In our paper we study the existence of a noncrossing Hamiltonian path (i.e. Hamil- 
tonian path which does not cross itself) in a given geometric graph. In particular we 
concentrate on a problem presented by Micha Perles on DIMACS Workshop on Ge- 
ometric Graph Theory in 2002, which asks to determine the largest possible number 
h(n ) such that every geometric graph on n vertices with at least Q) — h(n) edges has 
a noncrossing Hamiltonian path. Apart from improving the asymptotic bounds of h{n), 
we also focus on the restriction of the problem to some classes of geometric graphs. Let 
h\{n) denote the largest k such that when we remove the edges of an arbitrary complete 
subgraph of size at most k from a complete geometric graph on n vertices, the resulting 
graph always has a noncrossing Hamiltonian path. We prove that there exist constants 
0 < Ci < C 2 such that Ci • i Jn < hi ( n ) < Ci ■ y/n (Theorems 1 and 3). Let /12 (n) denote 
the largest k such that when we remove an arbitrary star with at most k edges from 
a complete geometric graph on n vertices, the resulting graph still has a noncrossing 
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Hamiltonian path. In Theorem 6 we show that Ii 2 (n) = \n/ 2] — 1. We also prove that 
hz(n) = \n/ 2] — 1, where /i 3 (ro) is the largest k such that when we remove at most 
k arbitrary edges from the complete geometric graph on n vertices in convex position, 
then the graph still has a noncrossing Hamiltonian path (Theorems 4 and 5). Further 
we prove that when we remove any matching from a complete geometric graph, the 
resulting graph has a noncrossing Hamiltonian path (Theorem 2). 

The paper is organized as follows: In section 2 we introduce basic definitions and 
notation. In sections 3 and 4 we study complete geometric graphs with removed complete 
subgraph and we prove the asymptotically tight bounds on the size of complete subgraph 
removed. In section 5 we prove the tight bounds on the number of edges removed from 
convex geometric graph and in section 6 we prove the tight bounds on the size of a star 
removed from a complete geometric graph. 

2 Definitions and Notation 

In this section we introduce basic definitions and notation used throughout this paper. 
A geometric graph G is an ordered pair (V, E) where V is a set of points in general 
position in the plane (called vertices of G ) and E is a set of straight-line segments 
connecting two vertices (called edges of G). A Hamiltonian path in a graph G is a 
path contained in G which visits all the vertices of G. A noncrossing Hamiltonian 
path in a geometric graph G = (V, E) is a Hamiltonian path which does not intersect 
itself. A convex hull of a set of points X C R 2 ,X = {.x'-| , . . . , x n } is a set of points 
H = {h £ R 2 : 3ai, . . . a n such that \/i € {1, . . . , n} Oj £ R,cti > 0, Y^=i a i = 
1 and h = Xw=i a i x i}- We say that a point p lies below a line l (line l must not be 
parallel to y-axis) if it lies in the half-plane defined by l which contains -oo on the 
y axis. Similarly we use the term above a line. A point u lies to the left ofv if the x- 
coordinate of u is less than or equal to the x-coordinate of v. Similarly we define that u is 
to the right ofv. Let s be a segment in R 2 defined by two endpoints u,v £ R 2 , where u is 
to the left of v. We say that a point p £ R 2 lies below the segment s if it lies below a line 
defined by u and v, to the right of u and to the left of v. Let Z C R 2 ,Z = { Zl ,...,z n } 
be a set of points in the plane. An x-monotone order of Z is an ordering of Z in which 
the x-coordi nates of the points form a monotone sequence. Analogously we define a 
y-monotone order of Z. A point z £ Z is called an extremal point of Z if it belongs to 
the boundary of the convex hull of Z. A segment uv, with u and v in Z, is called an 
extremal segment of Z if it is a subset of the boundary of the convex hull of Z. 

3 The Lower Bound for Complements of Cliques 

In the following two sections we consider a particular class C of geometric graphs — 
the complements of complete subgraphs. A geometric graph G = (V, E) is in C iff 
there exist X,Y C.V such that V = lUf, X (lY = 0 and E is the set of all the 
possible edges with at least one enpoint in Y (i.e. G is obtained from a complete graph 
by removing the edges of a complete subgraph). We prove that there exists a constant 
ci > 0 such that any geometric graph G £ C with \X\ < C\ ■ sJ\V\ has a noncrossing 
Hamiltonian path. 
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Lemma 1. Let G = ( V. E) be a geometric graph, G £ C. If there exists a line l such 
that all the vertices from X are in one half-plane defined by l and at least |X| vertices 
from Y are in the other half-plane, then there exists a noncrossing Hamiltonian path in 

G. 

Proof. We may WLOG assume that the line l is parallel to the //-axis and that all the 
vertices from the set X are in the left half-plane. We iterate the procedure described 
below to find a noncrossing Hamiltonian path in G. In each step, the procedure adds 
a new vertex to the path that it has constructed in the previous steps, according to the 
following rules: first, we take the upper extremal segment of the vertices not yet added 
to the path (the whole set V in the beginning) which crosses l (see Figure 1 on the left). 
If the last vertex added to the path or the left endpoint of the extremal segment is from 
Y, then the left endpoint of the segment is added to the path (Figure 1 on the right), 
otherwise we add the right endpoint (Figure 2 on the left). If no vertices were added yet, 
we may add either of the endpoints. If l does not cross the convex hull, then we simply 
add the remaining vertices to the path in an ./'-monotone order. See Figure 2 on the right 
for an example of the path produced by this algorithm. 

It is clear that the algorithm finishes when it adds all the vertices to the path. It is also 
easy to see that there are no two consecutive vertices from X on the constructed path 
and hence it is indeed a path in G. The only place in the algorithm, where two vertices 
from X might be added consecutively to the path, is when the convex hull of the vertices 
not yet on the path no longer intersects l, all these remaining vertices lie to the left of l, 
and the algorithm is adding them in an .i'-monotone order. But at that time there is at 
most one vertex from X not on the path: the other vertices from X were added in the 
previous steps because there are at least |X| vertices from Y in the right half-plane, and 
each of them has been added to the path immediately after a vertex from Y, with the 
possible exception of the first vertex of the path. 

It remains to prove that the path is noncrossing. We check that after each step of the 
algorithm, the path does not intersect the convex hull of the remaining vertices including 
the vertex just added to the path. From this it is obvious that the path does not intersect 
itself (at each vertex each of the following edges of the path must lie in the convex hull 
and the previous edges lie outside of it). When the path contains only one vertex, the 
claim is obviously true. When we add a new vertex to the path, the edge connecting the 
new vertex to the previous vertex on the path cannot intersect the convex hull of the 
remaining vertices — if the edge intersected the convex hull, then the previous vertex on 
the path would have to lie in the lower half-plane defined by the upper extremal segment 
of the remaining vertices intersecting l (see Figure 3). But then we get contradiction 
with the choice of the vertex in the previous step of the algorithm (the vertex cannot be 
an endpoint of the upper extremal segment intersecting Z). From the induction we know 
that no other edge of the path can intersect the convex hull of the remainig vertices and 
so we have proven that the path does not intersect itself. □ 

Now we prove a similar result for a general choice of the sets X and Y : 

Theorem 1. Let G = ( V. , E) be a geometric graph, G £ C. If \Y\ > 2 • \X \ ■ (|X| + 1) 
then there exists a noncrossing Hamiltonian path in G. 
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Fig. 1. The left figure shows the first step of the algorithm. The right figure shows the second step 
of the algorithm — add left end of the segment to the path. 





Fig. 2. On the left is the third step of the algorithm — add right end of the segment to the path. 
On the right is the whole noncrossing Hamiltonian path. 




Fig. 3. If the segment to new vertex intersected the convex hull of the remaining vertices the 
previous vertex on the path would have to lie in the lower half-plane. 
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Fig. 4. Partitioning of the plane into the strips and choice of the vertices to which Lemma 1 should 
be applied. 



Proof. We can WLOG assume that there are no two vertices from V with the same 
x coordinate. Now consider the partitioning of the plane into |W| + 1 vertical strips 
separated by vertical lines passing through the points of X (see Figure 4). From the 
pigeonhole principle it follows that there is a strip S with at least 2 • |X| vertices from Y 
in it. Let xi denote the number of vertices from X to the left of S (including the vertex 
on the left boundary of S). Similarly let x r denote the number of vertices of X to the 
right of S. Now we can certainly choose a vertex z from S such that there are at least 
2 • xi vertices in S to the left from z and at least 2 • x r vertices in S to the right from z. 
The vertical line passing through z splits S into two strips, denoted Si (the left one) and 
S r (the right one). 

We now describe a procedure to find a noncrossing path starting in z and containing 
all the vertices to the left of z. We find lines 1 1 , I 2 such that z 6 h (T I 2 , both l\ and I 2 
contain some vertex lying to the left from Si and there is no vertex lying to the left from 
Si which would lie above l\ or below 1 2 (see Figure 4). It is clear that in Si there are 
either at least xi vertices below (1 or at least xi vertices above ( 2 - Lets WLOG assume 
that the former is true. Let Z denote the set of vertices lying to the left from z and 
below li, including the two vertices on the line l -\ . Now we can apply Lemma 1 to the 
set Z (line l from the statement of the lemma is the left boundary of the strip S). From 
the lemma we get a noncrossing path starting in z containing all the vertices from Z 
(note that z is an endpoint of the upper extremal segment of Z intersecting l). Because 
there are no vertices from Z above the first segment of the path (all the vertices must 
lie below a line defined by the extremal segment from the second step of the procedure 
from Lemma 1 and the whole area above the first segment of the path lies above this line 
— see Figure 5), we can replace the first segment of the path by the path going through 
all the vertices in Si above the line l \ in an ./--monotone order (see Figure 6). By the 
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replacing we got a noncrossing path starting in z and using all the vertices of V to the 
left of z. Similarly we can get a path starting from z and containing all the vertices of V 
to the right of z, then join these two paths in z and get a noncrossing Hamiltonian path 
for G. □ 




Fig. 5. There are no vertices lying above the first segment of the path and below the line fa 




Fig. 6. Replacing the first segment of the path with an 2>monotone path 



Now using the Theorem 1 we can prove the following corollary: 

Corollary 1. Let G = (V. E) be a geometric graph and let k = V) ~ I-®|) k 

is the number of edges not present in G). If 2 • 2 • k ■ (2 • k + 1) < | Vj — 2 ■ k then there 
exists a noncrossing Hamiltonian path in G. 

Proof The idea of the proof is easy. We just use Theorem 1 for the original graph with 
removed edges of the complete subgraph containing all the edges missing in G. 

More formally, let X be the set of all vertices from V with degree less than | Vj — 1. 
The size of X is clearly less than or equal to 2 ■ k. Let Y = V\ X. From the statement 
of the corollary we know that |Y| >2-|X|-(|X| + l) and hence the assumptions from 
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the statement of Theorem 1 are satisfied and we can conclude that G has a noncrossing 
Hamiltonian path. This gives a lower bound of order Q(y/n) for the function h(n) 
defined in the introduction. □ 

Using the algorithm with a similar idea as the algorithm in Lemma 1 we can also 
prove the following result for the complements of matchings: 

Theorem 2. Let G = ( V, E) be a geometric graph which is a complement of a matching 
(i.e. a graph with minimum degree \V\ — 2 ) and \V\ > 3. Then G has a noncrossing 
Hamiltonian path. 

Proof. The case when \ V\ = 3 is trivial so we can assume \ V\ > 4. We use the following 
algorithm for a construction of a noncrossing Hamiltonian path: first take any extremal 
point of V to be the first vertex of the path. Let x be the last vertex on the path constructed 
so far. Then at each step we choose a vertex y which is an extremal point of the remaining 
vertices such that { x , y) is an edge in G which does not intersect the convex hull of the 
remaining points, and we add y to the path. The vertex y with the desired properties 
always exists if there are at least two remaining vertices. Let H' denote the set of points 
outside the path in the previous step of the algorithm and H the set of points outside the 
path in the current step. In the previous step, x was an extremal vertex of //' and so in 
the current situation there must be some segment yiy2 which is an extremal segment of 
H, such that neither of segments xy\ and xy 2 do intersect the convex hull of //. Because 
at least one of these segments must be an edge in G (its complement was a matching) 
we have just proven the existence of y. 

If there is only one remaining vertex and it is not connected by an edge to the last 
point on the path, we cannot finish the path. Let y\ be the last remaining vertex and y 2 . y$ . 
and j/4 be the last vertices on the path in the reverse order (remember that \V\ > 4). In 
this situation we remove the vertices y 2 and 2/3 from the path to get the situation from 
figure 7. Because 2/1 and 2/2 are not connected by an edge of G, we know that 2/4 must 
be connected by an edge to both 2/1 and y 2 and one of these edges does not intersect the 
convex hull of the remaining points. So we can WLOG add 2/1 to the path and then finish 
the path by adding 2/3 and y 2 . □ 




Fig. 7. The situation when we cannot add the last point to the noncrossing path. 
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4 The Upper Bound 



In this section we prove that there exist geometric graphs in C such that the size of X is 
0{y/\V\) and the graphs do not have a noncrossing Hamiltonian path. By proving this 
we get two asymptotically tight bounds on the function hi ( n ) defined in the introduction. 

Lemma 2. Let G = ( V. E) be a geometric graph such that all the vertices ofV are in 
the convex position. Let P = (vi ± , . . . , V{ n ) be a noncrossing Hamiltonian path in G. 
Then for any j £ {1, . . . , n — 2} it holds that from the three vertices Vi , , Vi j+1 , Vi j+2 at 
least two are neighbors on the boundary of the convex hull ofV (i.e. at least two of them 
are connected by an extremal segment ofV). 

Proof Assume that for some j the statement does not hold. Then the vertices on the 
convex hull are split into three nonempty contiguous blocks of neighbouring vertices. 
But the noncrossing path can enter only two of these three parts (once the path enters 
some part it cannot leave it without crossing itself) and hence such a path cannot be 
Hamiltonian. □ 



Theorem 3. For each no £ N there exists n £ N, no < n such that there is a geometric 
graph G = (V, E),G £ C, | V| = n satisfying |X| < 3 • yfn without a noncrossing 
Hamiltonian path. 

Proof. Lets have some n 0 . Let n be the smallest natural number greater than n 0 which 
is the square of some natural number. Now we describe a geometric graph on n vertices 
with the desired properties. We place n vertices of the graph on a circle with equal 
distances between neighbouring vertices. Then we split the vertices into yfn groups 
(each of size yfn) in such a way that each group forms a contiguous sequence on the 
circle. Now we define the partitioning of V into X and Y (and by this we determine 
the edges of the graph). We choose arbitrarily one group (let us call it g) and put all 
its vertices to X. In the other groups we put two vertices on the borders to X and the 
remaining points to Y (see Figure 8). Clearly |X| = ( yfn — 1) • 2 + yfn < 3 • yfn, so it 
only remains to prove that the graph does not have a noncrossing Hamiltonian path. 

Lets consider the first vertex u in g. Because u £ X, it must be connected to some 
vertex v from Y by an edge of the path. We write g' for the group containing the vertex v. 
Because both neighbors of u on the convex hull are also from X, v cannot be a neighbor 
of it. From this we can also trivially conclude that neither u nor v can be the endpoints of 
the Hamiltonian path. Now we focus our attention on the part of the path which should 
contain all the remaining vertices of g. From Lemma 2 we know that the path must go 
from v either to some other vertex of g' or to the neighbor of u in g. From any vertex of 
g the path must return back to g' to the neighbor of the vertex last used in g. From this 
we get that on the noncrossing Hamiltonian path there must be an alternation of vertices 
from g and from g' in such a way that between any two vertices from g there must be 
a vertex from g' D Y. But we have yfn vertices from g and only yfn — 2 vertices from 
g' fl Y and so this is impossible. □ 
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Fig. 8. Construction of a graph on n vertices with |A'| = 0(- v /n) without a noncrossing Hamil- 
tonian path 



5 Vertices in Convex Position 

In the following section we consider a class T> of convex geometric graphs. A geometric 
graph G = (V, E ) is in V iff the vertices of G are in a convex position. We show that if 
we remove |~| V|/2"| — 1 edges from the complete geometric graph then the noncrossing 
Hamiltonian path still exists (Theorem 4) but if we remove |~|V|/2] edges it need not 
exist (Theorem 5). Note that the bounds are tight. 

Theorem 4. Let G = (V, E) be a geometric graph, G € V, n = \V\. Let G = (V, F) 
be the complement ofG. If\F\ < [§] — 1 then there exists a noncrossing Hamiltonian 
path in G. 

Proof. Let vq, v±, . . . , v n — i be the vertices of G in clockwise order, starting with an 
arbitrary one. Consider the complete geometric graph G' = (V, EJF). Let Pi be the 
path Vi, |-i, Vi- 1 , Vi+ 2 , Vi- 2 , . . . (counting the indices modulo n) in G' . We observe 
that the paths Pi, ... , P\ n / 2 J are pairwise disjoint noncrossing Hamiltonian paths in G' . 
Since |.F| < [|] - 1 we are done for n even — at least one of the paths must avoid 
F and hence it is a noncrossing Hamiltonian path in G. If n is odd we observe that 
there are |n/2j edges {v 0 ,v n - \},{v\,v n - 2 }, ■ ■ ■ , {«|n/2J-i> v \n/ 2 \+i} which are not 
in any Pi. Let A denote this set of edges. We can assume WLOG that the set V forms the 
vertex set of a regular convex n-gon. Observe that every edge of G' can be mapped to 
an edge of A by rotating G' along its centre of rotational symmetry. So we can WLOG 
assume that at least one of the edges of F is in A (and hence is not in any of the paths 
Pi). Now we can conclude using the same argument as for n even that one of the Pi s is 
a noncrossing Hamiltonian path in G. □ 

Theorem 5. For each n,n > 2 there exists a geometric graph G n = ( V n . E n ) such that 
G n £ T>, \V n \ = n, \E n \ = Q)— [tt] and G n does not have a noncrossing Hamiltonian 
path. 

Proof. Let vo, V\, ..., v n -± be the vertices of G n in clockwise order, starting with 
an arbitrary one. First we make an easy observation: the first (and the last) edge of a 
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noncrossing Hamiltonian path P is an extremal segment of V. Consequently, if v+Vj is 
an edge of such a path but not an extremal segment, then P contains at least one extremal 
segment from each of the intervals t\ . . . v 3 and v 3 . . ,v t . 

Let k = [§ ]. We choose F n = {{u 0 , ui}, {ui, v 2 }, ■ ■ ■ , {v k -i, v k }} and E n as 
the complement of F n . Let B = {uq. .... Vk}. Suppose there exists a noncrossing 
Hamiltonian path P avoiding F n . No edge in P may join two points of B, as then by 
the observation above it would have to contain an edge from F n . Therefore B is an 
independent set of P, which is impossible as the largest independent set of P is of size 
k. □ 




Fig. 9. Construction of a noncrossing Hamiltonian path. There are either two uncovered vertices 
in one of the cones (on the left) or every cone contains exactly one uncovered vertex (on the right). 




Fig. 10. Complement of a star without a noncrossing Hamiltonian path. 



6 Complement of Star 

In this section we consider a class of geometric graphs S. A geometric graph G = {V. E) 
is in S iff E is a complement of F, where F is the edge set of a star K-\ j : . We prove 
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that for k < |jVj/2] — 1 there always exists a noncrossing Hamiltonian path but for 
k > [|V|/2] it need not exist. 

Theorem 6. For any geometric graph G = ( V. E) on n vertices, G € S such that 
|.F| < [f] — 1 there exists a noncrossing Hamiltonian path in G. For any n,n > 2 
there exists a geometric graph on n vertices G n , G n £ S with |.F n | = [^] such that 
there is no noncrossing Hamiltonian path in G n . 

Proof. Let C be the center of a star F, where F has at most [f ] — 1 edges. We partition 
the plane into cones by extending the edges of F into rays starting in C. If there is a 
cone that contains at least 2 vertices that are not covered by F, we use the construction 
from Figure 9 (left). Otherwise there must be exactly one vertex in each of the cones. At 
most one of the cones spans an angle greater than straight, let x be the vertex inside this 
cone if such a cone exists, otherwise let x be an arbitrary vertex not covered by F. The 
half-line starting in C passing through x splits its cone into two, so at least one of them 
spans an angle smaller than straight. Now we use the construction from Figure 9 (right). 

We proceed to prove the second claim of the theorem. Let V n and F n look as in 
Figure 10. Note that the boundary of the convex hull of V n contains the vertex C, two 
edges of F n , and all the vertices not covered by F n . Moreover, every cone defined by 
the rays extending the edges of F n contains at most one vertex not covered by F n . 

Suppose there is a noncrossing Hamiltonian path P. If C is the first vertex of P, 
then the second vertex of P is one of the uncovered ones, the third one belongs to one 
of the half-planes determined by the first edge and P cannot get to the other half-plane 
without intersecting its first edge, so the path cannot be Hamiltonian. Similarly, if C is 
an interior vertex of P, the edges of P adjacent to it split the remaining vertices into 
three nonempty parts, and P cannot cover more than two of them without intersecting 
itself. □ 



Conclusion 

In this paper we proved that for several classes of geometric graphs (graphs with vertices 
in convex position, complements of stars, complements of cliques and complements of 
matchings) we need to remove J?(n) edges from a complete geometric graph to forbid 
every noncrossing Hamiltonian path. This lets us conjecture that h(n ) = ()(n) where 
h(n ) is the largest k such that when we remove arbitrary set of k edges from a complete 
geometric graph on n vertices, the resulting graph still has a noncrossing Hamiltonian 
path. 

Acknowledgement. The authors would like to thank Jan Kratochvfl and Pavel Valtr 
who brought the problem to our attention, discussed it with us and helped us with the 
preparation of this paper. 
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Abstract. GraphAEL extracts three types of evolving graphs from the 
Graph Drawing literature and creates 2D and 3D animations of the evolu- 
tions. We study citation graphs, topic graphs, and collaboration graphs. 
We also create difference graphs which capture the nature of change 
between two given time periods. GraphAEL can be accessed online at 
http : / / graphael . cs . arizona . edu. 



1 Introduction 

While static graphs arise in many applications, dynamic processes give rise 
to graphs that evolve through time. Such dynamic processes can be found in 
software engineering, internet /telecommunications traffic, and social networks, 
among others. Typically, the underlying graph structures are large, and depend- 
ing on the time-granularity, may contain from a few to a large number of times- 
lices. The graph representing a particular timeslice may contain fewer/more ver- 
tices/edges than the one representing the previous timeslice, or it may differ in 
some other graph attributes, such as node-weights, edge-weights, or labels. 

We describe algorithms and techniques for visualization of dynamic graph 
processes, using the evolution of the graph drawing literature as an example of 
such a process. GraphAEL is composed of three distinct components: 

1. a relational database that stores and catalogs the data; 

2. a graphical user interface (GUI) for querying the database and for graph 
creation; 

3. a toolkit for interactive visualization of the resulting graphs. 

We built a MySQL database and populated it with all the papers from the 
graph drawing proceedings in the period 1994-02. The database can be queried 
online via a GUI. In addition to standard queries and responses, we can generate 
graphs that evolve through time. In particular, we generate citation graphs , co- 
authorship graphs, and topic graphs at varying time-granularity. Each of these 
types comes in two flavors, individual graphs and cumulative graphs. To study 

* This work is supported in part by the NSF under grant ACR-0222920. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 98-110, 2004. 
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the underlying graph processes, we also extract difference graphs which facilitate 
visual discovery of new trends and patterns. The evolution of the underlying 
graphs and the difference graphs can be studied via 2D and 3D animations. 

2 Related Work 

The visualization techniques for graphs that evolve through time are related to 
previous work in dynamic graph visualization. A number of papers use modifi- 
cations of static graph drawing algorithms [4,13,18]. Incremental graph drawing 
is used in DynaDag [17]. Bayesian decision theory, together with a force-directed 
method are used in [3] to model dynamic graphs. Sequences of graphs are consid- 
ered in [7] in order to create smoother transitions. Special classes of graphs such 
as trees, series-parallel graphs and sf-graphs have also been studied in dynamic 
models [5,15]. 

More recently, several papers consider larger graphs that evolve through time. 
In [2] a system for visualizing network evolution is presented, in which each mod- 
ification is shown in a separate layer of 3D representation with vertices common 
to two layers represented as columns connecting the layers. Along these lines, 
Collberg et al [6] describe a graph-based system for visualization of software 
evolution, which uses a modification a force-directed algorithm for visualization 
of large graphs [11]. Using a preliminary version of our system we analyzed 
the ACM digital library database, focusing on collaboration graphs and cate- 
gory graphs but the lack of citation data prevented us from extracting citation 
graphs [8]. 

3 The Relational Database 

We gathered data about the graph drawing community, more specifically the 
nine years of published proceedings from the International Symposium on Graph 
Drawing. This data came from three sources: an XML file with information from 
the DBLP Computer Science Bibliography, a GML file used in the 2001 graph 
drawing contest [1], and the proceedings themselves. We designed a schema that 
maps the data to all necessary relations for creation of collaboration, topic, and 
citations graphs; see Fig. 1. The database schema consists of seven tables stor- 
ing information about articles, authors, proceedings, and publishers. Rectangles 
represent entities, circles represent attributes, and diamonds represent binary 
relationships. The primary key for each entity is underlined. 

We used MySQL version 4.0.12, a powerful open-source database system, 
to implement this schema. The XML file obtained from DBLP contains biblio- 
graphical information for every paper published in the GD proceedings. It does 
not, however, contain citation information. We built an XML parser to extract 
the relevant bibliographical data from DBLP. The citation data for the period 
1994-00 was gathered from the the 2001 graph drawing contest GML file. Cita- 
tion data for 2001 and 2002, was manually entered directly from the proceedings. 
We intend to update the database with new entries from the latest proceedings 



100 



C. Erten et al. 




Fig. 1. The bibliographical database schema in Extended Entity- Relationship format. 



of the symposium on graph drawing, on an yearly basis. The database can be 
accessed and queried online at http://graphael.cs.arizona.edu. 



4 Graphical User Interface 

To interact with the database we created a graphical user interface; see Fig. 2. 
The GUI, written in Java, is available as both a web-based Java applet and a 
downloadable Java application. The applet version makes use of a PHP script as 
a servlet to maintain the security of the database server. The GUI allows users 
to query general dataset statistics, search for specific authors and papers, and 
query citation data by referencer and referencee. In addition, users can generate 
collaboration, topic, and citation graphs on the fly in the form of GML files with 
varying time-granularity. The applet opens a new browser window displaying 
the GML file, which can be saved locally. All of these queries can be limited in 
scope by date and the amount of results requested. 

The top panel of the GUI contains buttons that allow the user to change the 
querying parameters. The left panel of the GUI contains buttons representing the 
different types of results and GML files that can be obtained. The ’Go’ button, 
activates a servlet that will then query the database located on the server. The 
server returns the results to the servlet which then hands them to the applet 
where they are displayed. The following queries are supported by the GUI: 

— Database Statistics: lists data statistics, including number of articles and 
authors; 

— General Query: allows for search by author name, or title words; 

— Co-authorship GML: creates a co-authorship graph, for the given parame- 
ters; 

— Referencer Search: finds all papers that reference a particular paper; 



GraphAEL: Graph Animations with Evolving Layouts 101 




Fig. 2. The Graphical User Interface, shown here displaying a GML file. 



— Referencee Search: finds all papers that are referenced from a particular 
paper; 

— Citation GML: creates a citation graph, for the given parameters; 

— Topic Graph GML: creates a topic graph, for the given parameters; 

— Top X Lists: lists the most productive authors, most collaborative authors, 
etc. 



5 Interactive Visualization of Evolving Graphs 

GraphAEL calculates the layout for evolving graphs quickly. The underlying graph 
drawing algorithm is a modification of the GRIP algorithm [11,12]. Given a graph 
G = ( V , E) , GRIP computes a hierarchical sequence of vertex filtrations given by 
a sequence of m subsets of V. Let Vi, Vj be two filtrations such that i < j, then 
Vi C Vj and V m = V. The graphs defined by this filtration sequence are laid 
out in order, with the layout of the graph at each level i providing a basis to 
guide the layout of the graph at level i + 1. In this way large-scale structures of 
the graph are quickly captured by the filtration levels with low indices, and the 
small-scale details are refined in the layout of the levels with high indices. 

The layout of each filtration level is calculated with the force-directed place- 
ment method of Kamada-Kawai [14]. For a vertex v in G, the displacement of v 
is calculated by: 
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F kk (v) 



E 

u£Ni(v) 



||pos[u] — pos[u] || 2 
distc{u,v ) 2 • edgeLength 2 



(pos[u] — pos[u]) 



This equation is convenient because it calculates the displacement on each 
vertex based only upon its physical and graph distances from other vertices in the 
graph. Since in general, vertices in a particular filtration level are not adjacent to 
any other vertex in the level, but are instead connected to other vertices in the 
higher levels through a sequence of edges, using graph distances is a reasonable 
approach. Only in the last level of of filtration does the adjacency make sense, 
since the last level is the same as the initial graph. Thus at this level, it is possible 
to use force directed placement methods which depend on a notion of adjacency. 
The Fruchterman- Reingold method [10] for calculating the displacement of a 
vertex v, F fr (v) = F ayFR + F r ^p R , is used at the last filtration level, where the 
attractive and repulsive force vectors are given by: 



' a,FR 



E 

u£Adj(v) 



||pos[u] — pos[v ] 1 1' 
edgeLength 2 



( pos[u ] — pos[u]) 



Fr,FR 



E edgeLength 2 

— n np (p° s M - p° s M) 

ueNi ( v ) \\pos[u] - pos[v}\\ 



5.1 GraphAEL’s Layout Algorithm 

While GRIP does a decent job of laying out large static graphs, there is no 
appropriate mechanism to assign the vertices and edges of a graph attributes, 
such as weights, which might affect the layout. This is a major drawback in 
visualizing certain relations and even more so in visualizing evolving graphs. For 
instance, in visualizing the GD citation graph, it is useful to have a notion of 
both the weight of a vertex, (which might represent the number of times a paper 
is cited) , and a temporal notion (which might represent the year the paper was 
written). In contrast, GraphAEL supports the layout of graphs with node-weights 
and edge-weights, as well as the notion of a timeslice which is used to visualize 
graphs with a temporal component (evolving graphs) . 

GraphAEL tries to place heavy nodes well away from each other and to place 
vertices connected by heavy edges, closer to each other. Appropriate modifica- 
tions to the force-directed algorithm were made to accommodate these character- 
istics. Each edge in a graph is given an ideal length that it will attempt to attain, 
although this is not always possible. An edge e of weight w e connecting vertices 
u,v of weight w u ,w v respectively is given an ideal length of y/w u ■ w v /w e . 

Since the Kamada-Kawai method is used to determine parts of the layout, 
it is necessary to come up with a modified graph distance for weighted graphs. 
Because of the computational and space requirements of calculating the effects 
of all paths between two vertices, or of computing the shortest weighted path 
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between them, GraphAEL uses an approximation. Let pi,p 2 ,... ,Pn be the se- 
quence of vertices in the shortest unweighted path in G connecting two vertices, 
u and v. The modified Kamada-Kawai vector is given by: 

y- f 2 1| pos[u] -pos[v\ || 2 

^ \optDistG(u,v) 2 ■ edgeLength 2 + ||pos[u] — pos[u]|| 2 

where optDistc(u,v) is defined by, 



— 1 ) (pos[w] — pos[u]), 



optDistc{u,v) = V'' ^ P ' ' 

To achieve an aesthetically pleasing layout of the graph, it is also necessary to 
modify the Fruchterman-Reingold vectors. As in GRIP, the Kamada-Kawai cal- 
culation results in a good approximate layout so that the Fruchterman-Reingold 
calculations can quickly ’’tidy up” the layout. The modifications needed to sup- 
port weighted graphs with the ideal edge lengths described earlier are simple: 
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5.2 Timeslices and Animation 

To visualize a series of graphs, embodying the evolution of a set of relationships 
over time, we associate another attribute, called a timeslice, with each vertex. 
The timeslice of a vertex is simply a label associated with a vertex. We use the 
timeslice attribute to partition the vertices of a graph into groups by time. Sev- 
eral simple modifications to the layout algorithm are needed to accommodate 
timeslice information. Repulsive forces should only exist between vertices in the 
same timeslice, and thus the optimal distance between vertices in different times- 
lices which are connected by an edge is zero. In the Kamada-Kawai vectors, the 
only alteration required is that the the function optDistc{u,v) be redefined so 
that for two vertices it, v with timeslice indices of t u and t v respectively: 



optDistc(u,v ) = S tu t v ■ “ 

w e 

where St u t v is 1 if t u = t v and 0 otherwise. The modifications needed for 
the Fruchterman-Reingold calculations are similar. Repulsive forces are simply 
eliminated between vertices in different timeslices, F r , w , t ,FR = dt u t v • F r:Wt pR 
while the attractive forces remain unchanged, F ajWit ,FR = F ajW: pp. 
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The timeslice information alone is not enough to nicely layout evolving 
graphs; it is also necessary to arrange edges between the timeslices so that the 
resulting layouts can be used for animation. Perhaps the most straightforward 
and common case is when one has a series of “snapshots” of a graph taken at 
some interval over a period of time. When laying out such a graph there are nor- 
mally two types of constraints which need to be met: Each timeslice should have 
a pleasing layout, and the layout of consecutive timeslices should be similar, that 
is, the mental map should be preserved. Another way of formulating this latter 
constraint is that vertices on one timeslice should tend toward their positions in 
adjacent timeslices. To meet these constraints the timeslices are combined into 
a single graph and edges are added between vertices with the same labels in 
adjacent timeslices. 

Since vertices in different timeslices have no repulsive forces between them 
(but the edges between them retain their attractive forces) these additional edges 
attract each vertex toward the vertices associated with it in adjacent timeslices. 
The trade-off between layout within a timeslice and mental map preservation can 
easily be adjusted by altering the weights of the inter-timeslice edges. Heavier 
weights lead to better mental map preservation and lighter weights improve the 
layout of each timeslice. For the visualization of the graphs in this paper the 
weight of an inter-timeslice edge connecting two vertices of weights w u and w v 
is given a weight of (w u + w v ) /2, causing heavier vertices to stay in the same 
position over multiple timeslices. 

One disadvantage of this method of adding edges is that although the move- 
ments of a vertex from one timeslice to the next are minimized, it is still possible 
that over a span of many timeslices vertices will drift far from their initial po- 
sitions. If this is an undesirable effect it can be prevented by adding additional 
edges between vertices in distant timeslices. A straight-forward approach is to 
create a clique of all the occurrences of each vertex over all timeslices. GraphAEL 
has both options implemented. 

Once the layout of the evolving graph has been computed, several visualiza- 
tion options can be used. A static view, capturing all timeslices can be displayed. 
Each timeslice can be restricted to its own 2D plane or can be drawn in 3D and 
the individual graphs arranged on top of each other. Edges connecting different 
timeslices can be shown or hidden. Alternatively, the evolution can be shown 
via an animation between the timeslices in 2D or 3D. The animation uses an 
interpolation between timeslices and fading in/out of appearing/disappearing 
vertices and edges. Edge and vertex coloring is used to convey additional infor- 
mation. For example, in the citation graphs we indicate the direction of an edge 
by gradually varying the color from light(source) to dark(target) and the vertex 
color indicates the relative age of a particular paper. 

Finally, GraphAEL can generate and display difference graphs. The difference 
graph between two adjacent timeslices captures the difference between the two 
underlying graphs. For example, when visualizing the topic graph we may miss 
significant growth in an area with a few papers if the corresponding vertices are 
relatively small. With this in mind, the difference graph captures the percentage 
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Fig. 3. Top left: Cumulative citation graph for the period 1994-02. Bottom Left: 
view of the area enclosed in the red rectangle from top left. Right: Labeled vertices in 
a “symmetry” cluster. Directed edges go from light /source to dark/target. The color 
of the nodes corresponds to the year it was published: the lighter the node, the older 
the paper. 



change between levels. We create a series of difference graphs and as it is an 
evolving graph we can use all the tools for evolving graphs. 

6 Interactive Visualization of the GD Literature 

In this section we consider three types of evolving graphs extracted from the 
MySQL database: citation graphs, topic graphs, and collaboration graphs. We 
also calculated some general statistics about the data and we include gnuplot 
charts and tables for the data in the Appendix. 



6.1 Citation Graphs 

The citation graph for a given time period is a simple vertex-weighted directed 
graph in which the vertices correspond to distinct articles. A directed edge con- 
nects two articles, with the article that cites as the source and the cited article 
as the target. The weight of a vertex is determined by the number of citations 
an article received, divided by the number of years since its publication. The 
citation graphs can reveal information about influential papers in the field, and 
their relation with the rest of the graph. Fig. 3 shows several views of the citation 
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Fig. 4. Individual topic graphs for the period 1994-02, restricted to the top 20 title 
words, shown in timeslices of 3 consecutive years each: 1994-96 on the bottom, 1997-99 
in the middle, and 2000-2002 on the top. On the left are all the labeled nodes in each 
timeslice with invisible edges. 



graph. The size of a vertex reflects its weight, i.e. a large vertex corresponds to 
an article that is cited many times relative to its age. The vertex color indicates 
the age of the article, the darker a vertex the more recent the corresponding ar- 
ticle is. The edge color changes from yellow to blue to indicate direction, where 
the blue end of the edge is the target vertex. 

6.2 Topic Graphs 

The topic graph for a given time period is a simple vertex-weighted and edge- 
weighted undirected graph in which vertices correspond to title words and edges 
are placed between title words that co-occur in research papers. The weight of 
a vertex in the topic graph is proportional to the number of papers that contain 
the corresponding word in their titles. Similarly, the edge weight is proportional 
to the number of papers in which the two corresponding words co-occur in the 
title. Topic graphs can reveal information about related topics, the concentration 



GraphAEL: Graph Animations with Evolving Layouts 107 




Fig. 5. The individual difference graphs capturing the change from 1994- to 1997-99 
(left) and 1997-99 to 2000-02 (right). Dark nodes indicate growth and light nodes 
indicate decline. Note the percentage increase in “large” and decline in “interactive”. 




Fig. 6. Top left: collaboration graph for the period 1994-97. Bottom Left: view 
of the area enclosed in the red rectangle from top left. Right: closer view with more 
details. 



of research on a specific topic and the trends as they evolve through time. Fig. 4 
contains several visualizations of the topic graph. The edges connecting same 
vertices in adjacent timeslices help with mental map preservation and are used 
to determine the vertex locations in the animation between timeslices. 

We also construct topic difference graphs in which the weight of a vertex is 
the percentage of its weight change between adjacent timeslices. The edge weight 
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Fig. 7. Screenshots from the animation of the cumulative collaboration graph in the 
period 1994-02. Each timeslice represents a one year period. A movie of the actual 
animation can be seen at the GraphAEL webpage. 



is proportional to the corresponding percent-change between two time periods. 
Together with the topic graphs, the topic-change graphs can be effectively used 
to visualize the evolution of GD research focus through time. Fig. 5 shows the 
topic-change graph corresponding to the topic graph in Fig. 4. 



6.3 Collaboration Graphs 

Collaboration graphs are simple undirected node-weighted and edge weighted 
graphs. Vertices represent unique authors and there is an edge between two ver- 
tices if the respective authors have collaborated on a research paper. The weight 
of a vertex is determined by the author’s collaborativeness and productivity. The 
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Table 1. Statistics for GD, ACM [8], and NCSTRL [16]. 



General 


GD- Value 


ACM-value 


NCSTRL- value 


Total papers 


413 


51503 


13169 


Total authors 


502 


81279 


11994 


Authors per paper 


2.54 


2.32 


2.22 


Papers per author 


2.09 


1.80 


2.55 


Collaborators per author 


3.74 


3.36 


3.59 


Percentage of giant component 


49 


49 


57.2 


Percentage of 2 nd component 


0.02 


0.11 


0.004 


Clustering Coefficient 


0.60 


0.62 


0.50 


Average Distance 


4.33 


9.26 


9.7 


Maximum Distance 


10 


30 


31 



weight of an edge represents the strength of the collaborative ties between two 
authors. Fig. 7 shows the evolution of the cumulative collaboration graph, i.e. , 
each timeslice adds to the previous timeslice. Once an interesting timeslice to be 
viewed is found it is easy to get a detailed visualization of that specific timeslice. 
For example, Fig. 6 shows the collaboration graph for the period 1994-97. For a 
more detailed view we zoom in one of the areas where there is a concentration 
of several well-connected components. 



6.4 Statistics from the Graph Drawing Literature 

We compared the data from the GD database, with the data from two computer 
science databases, the ACM database [8] and the NCSTRL database [16]. Table 1 
shows a summary of the overall statistics and they are examined in detail in the 
full version of the paper [9] . 
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Abstract. We propose a method for visualizing a set of related 
metabolic pathways using 2|D graph drawing. Interdependent, two- 
dimensional layouts of each pathway are stacked on top of each other 
so that biologists get a full picture of subtle and significant differences 
among the pathways. Layouts are determined by a global layout of the 
union of all pathway-representing graphs using a variant of the proven 
Sugiyama approach for layered graph drawing that allows edges to cross 
if they appear in different graphs. 

1 Introduction 

Metabolic pathways are subnetworks of the complete network of metabolic re- 
actions. They differ across organisms because different species may for example 
have developed different ways to synthesize a specific substance. We are inter- 
ested in visualizing several related metabolic pathways in such a way that the 
inherent differences can be explored by trained biologists in order to understand 
the evolutionary relationships among species. The structural characteristics of 
metabolic pathways make them particularly amenable to layered graph drawing 
methods [2,24,27]. We therefore propose to visualize sets of related pathways in 
2^ dimensions, i.e. to produce interdependent, two-dimensional, layered layouts 
for all pathways and stack them on top of each other so that the most similar 
pathways are adjacent. 

To realize such a design, two graph drawing issues need to be addressed. We 
have to determine a suitable ordering to reduce the variation between consecu- 
tive pathways and we have to deal with dependencies introduced by the many 
substances and reactions present in more than one pathway. An interesting con- 
sequence is a new type of crossing number in which the weight of a crossing may 
be different for each pair of edges. 
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This paper is organized as follows. In Sect. 2, we give some background 
on the type of networks considered and define the graph model on which we 
operate. After briefly reviewing related approaches to visualizing similar graphs, 
we specify our visualization design in Sect. 3. In Sect. 4 we address the ordering 
problem while in Sect. 5 we discuss a method to determine a global layout of 
the stacked pathways considering that edge crossings are less severe if the edges 
involved do not coexist in the same pathway. To demonstrate the utility of our 
method it is applied to typical real-world data in Sect. 6. 



2 Metabolic Pathways 

Metabolic reactions are transformations of chemical substances which occur in 
living beings and are usually catalyzed by enzymes. A reaction changes certain 
substances (reactants) into different ones (products). Metabolic reactions form 
large and complex networks as for example shown on the well-known Biochem- 
ical Pathways Poster part 1 [21]. A metabolic pathway is a subnetwork of the 
complete network of metabolic reactions (see Fig. 1). Such a pathway can be 
given by biochemical textbooks and databases such as KEGG [17] or be defined 
by functional boundaries such as the network between an initial and a final 
substance. 




Fig. 1 . A metabolic pathway 



Metabolic pathways differ across organisms. Studies suggest significant vari- 
ations even in most central pathways such as glycolysis [6] . Comparative analysis 
of pathways across species has several applications: 

— understanding the evolutional relationships between species. 

— development of species-specific drug targets (e.g. antibiotics). 

— identification of previously unknown parts of pathways in a species. 

From a formal point of view a metabolic pathway is a directed hyper-graph. 
The vertices represent the substances within a pathway, the hyper-edges rep- 
resent reactions. A hyper-edge representing a reaction connects substances and 
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is labeled with the enzyme(s) that catalyze the reaction. As hyper-graphs are 
not commonly used for simulation and visualization purposes, metabolic path- 
ways are often modeled by bipartite graphs, e.g. Petri-net representations of 
pathways [23]. Here the reactions themselves are vertices, and edges are binary 
relations connecting substances with reaction vertices. 

Three structural properties of metabolic pathways are particularly relevant 
for visualization. They typically are: 

1. small in size (often less than two dozen reactions) 

2. sparse (because most substances are involved in few reactions) 

3. directed and acyclic (due to a dominant direction of most reactions and a 

small number of cyclic pathways such as the citrate cycle) 

Several similarity measures have been introduced to compare pathways. They 
are characterized by the combination of structural information about metabolic 
networks with additional data such as sequence information [11], the enzyme 
classification hierarchy [29], or information about the hierarchical clustering of 
reactions into pathways [20]. All these similarity measures require additional 
information to the metabolic network, for example the genome sequence of the 
organisms. 

We suggest a more general measure for reaction or pathway similarity which 
only depends on the structure of the graph, i.e. on the presence or absence of 
vertices and edges. This facilitates comparison of metabolic pathways from dif- 
ferent sources (databases, experiments) even if no additional data is available or 
the pathway boundaries are user-defined. Note that in general our visualization 
method is independent of the similarity measure which is only used to compute 
the order of the stacking. 

2.1 Formalization 

For the purpose of this paper the distinction between vertices representing reac- 
tions and substances is not important. A metabolic pathway is therefore modeled 
by a directed graph G = (V, E ), where a vertex v £ V represents either a sub- 
stance or a reaction, and an edge (v, w) £ E indicates that substance v enters 
reaction w or that reaction v produces substance w. 

Our goal is to visualize a set {G± = (Vi,Ei),G 2 = ( Vi , E 2 ), ■ . . , G r (V r , E r )} 
of graphs that represent related pathways. Though the dissimilarity of two path- 
ways Gi = (' Vi,Ei ), Gj = ( Vj,Ej ) can be measured in various ways, the argu- 
ments outlined above suggest that the following measure is appropriate. 

Consider the union graph G = (V) E ), where V = (J[=i ^ an d E = Ul=i Ei, 
and a set of relevant elements P C (V LIE). The Hamming distance of two graphs 
Gi,Gj, 1 < i,j < r, is defined as the cardinality of the symmetric difference of 
relevant elements present in either graph, i.e. 

Sp(Gi,Gj) = |((V5 U Et) A (Vj U Ej)) n P)| . 

Thus, dissimilarity can be defined in terms of missing edges, vertices, or both, 
by choosing P accordingly. 



114 U. Brandes, T. Dwyer, and F. Schreiber 



3 Visualizing Similar Networks 

There are two common approaches to compare pathways in different species visu- 
ally. Either the combination of all pathways into one diagram or the production 
of a drawing for each species. The first method is used in many textbooks, on 
a famous poster [21] and in systems such as BioMiner [27] and BioPath [12]. In 
general, the drawings contain either multiple (parallel) reaction-edges or single 
ones which are color-coded depending on the occurrence of a reaction in a set of 
species. An example for the second approach is the visual interface of the KEGG 
database [17] where all enzymes found in the gene catalog of a specific species 
are marked in the reference pathway map in order to identify the species-specific 
pathways. To compare pathways in r species, r diagrams are needed. A dynamic 
visual comparison method producing a diagram for each species is presented 
in [25]. 

These solutions are restricted to the comparison of only a few pathways, 
because either (in the first approach) the readability of the diagram decreases or 
(in the second approach) the size of the picture increases dramatically with each 
new pathway. Furthermore, none of the above mentioned methods deal with the 
problem of computing an appropriate order of the pathways. 

3.1 Drawing Graphs in 2^D 

To allow users to compare several metabolic pathways, we want to draw them 
in different parallel planes, but with interdependent layouts. We call this type 
of representation a 2^D drawing because the third dimension is used in a way 
fundamentally different from the other two. Note that, traditionally, the axes 
are interchangeable in 3D graph drawing. 

The idea of treating the third dimension as an independent channel conveying 
a different kind of information [30] has been applied in numerous settings dealing 
with different types of network data. For instances, the data can be a single 
graph with vertex attributes that determine the third dimension [19,5], a graph 
together with a hierarchical clustering [9] or graphs that evolve over time [4,7]. 
The latter example relates 2^D graph drawing to dynamic graph drawing and 
graph animation, since the difference in layout between consecutive states of the 
graph should be small. Another interesting example is the dependence graph of 
spreadsheet cells [26] , which has a natural 2D layout that can be spread in three 
dimensions to indicate the data flow. 

The case considered in this paper is somewhat different from those above. 
If we consider our pathways as subgraphs of the union graph, the data are 
characterized by the fact that 

— there is a set of subgraphs with no given ordering, 

— vertices and edges are likely to appear in more than one subgraph, and 

— there are no edges to be drawn between different subgraphs. 

Moreover, we aim at a representation that consists of a stacked set of two- 
dimensional (layered) drawings for each subgraph where a vertex has the same 
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2D-coordinate in each drawing in which it appears (so that it can be represented 
by a straight column) . 

Our approach can therefore be viewed as a generalization of both the 
Sugiyama method [28] and parallel coordinates [15]. 



4 Stacking Order 

Let {Gi = (Vi, -Ei), G<i = (V 2 , E 2 ), . . . , G r (V r , E r )} be a set of graphs, and G = 
(V, E) their union graph. We want to order these graphs so that those which are 
similar with respect to Hamming distance are close to each other. Variations of 
this problem arise in many applications, and two of them are especially relevant 
in our context. Let P C (V U E) be a set of relevant elements. 

Problem 1 (MIN SUM ORDERING) . Find a permutation er = (ay, . . . , ay) such 
that 

r— 1 



^MG CTi ,G CTi+1 ) 



i= 1 



is minimum. 



In the context of data transmission, this problem is also known as DOP 
(data ordering problem) [22] . A less restricted version (with an arbitrary distance 
matrix) is considered in [18], where the goal is to order parallel coordinates. 

Theorem 1 . MIN SUM ORDERING is MV-hard. 

Proof. Straightforward reduction from HAMMING DISTANCE TSP [1], 

For a permutation er = (ay, ... , ay) we define the lifetime of an element p £ P 
to be A a (jp) = {1 < i < r : p £ G ai }. An element p £ P is called persistent , 
if its lifetime spans the entire interval {1, . . . , r}, and transient otherwise. The 
number of appearances and disappearances of p £ P are defined by a a {p) = 
|{1 < i < r : p £ G ai \ G^J | and d a (p) = |{1 < i < r : p £ G ai \ G ai+1 }\. 
Note that persistent elements make no appearances or disappearances. 

Corollary 1 . MIN SUM ORDERING is equivalent to minimizing 

(<v(p) + d a (jp)) . 

peP 

A related alternative objective is therefore to minimize the maximum number 
of times an element appears or disappears in an ordering. 

Problem 2 (MIN INTERVAL ORDERING). Find a permutation a = 
(ay, ... , oy) such that 

max{a ff (p ) , d a (p ) } 
peP 

is minimum. 
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This problem is a generalization of the consecutive ones property, since, if 
p £ P is transient, max{a CT (p), d a (p)} is the number of lifetime intervals (and 
zero otherwise). We have the following complexity status. 

Theorem 2. MIN INTERVAL ORDERING is MV-hard , but it can be deter- 
mined in linear time whether there is an ordering such that each relevant element 
appears or disappears at most once. 

Proof. See [14]. Since the restricted problem corresponds exactly to the consec- 
utive ones property, it is linear-time solvable using PQ-trees [3]. 

Since both problems arise in many contexts, a variety of algorithms is avail- 
able to determine an ordering. For MIN SUM ORDERING, for instance, heuris- 
tics for the TSP are easily adapted to yield good orderings. Other alternatives 
include a simple greedy heuristic that sucessively inserts a new element where it 
causes the smallest increase of the objective (this method is claimed to perform 
well for instances in data transmission [22]) and one-dimensional projections of 
the distance matrix obtained by principal component analysis. 



5 Global Layout 

To maximize the similarity between visualizations of related graphs, we compute 
a layout only for the union graph. The drawing of vertices and edges thus re- 
mains unchanged throughout their lifetime. Since graphs representing metabolic 
pathways tend to be sparse and acyclic, the Sugiyama framework for layered 
graph layout [10,28] is widely used to visualize individual pathways [2,24,27]. 

While straightforward application of any of these methods to the union graph 
is possible, better results can be obtained by taking into account that the indi- 
vidual graphs are to be viewed in separate, parallel planes. To avoid confusion 
with layers in the Sugiyama approach, we refer to these as strata. 



5.1 A New Crossing Minimization Problem 

Our main modification with respect to standard variants of the Sugiyama frame- 
work is motivated by the observation that crossings of edges not present in the 
same stratum can be resolved by 3D rotation or stereo depth perception as 
shown in Fig. 2. Hence, when computing the global layout, we need not account 
for such crossings the same way as crossings between edges with overlapping 
lifetime intervals. 

A common approach for crossing reduction in layered layouts is based on a 
layer-by-layer sweep, in which the ordering of vertices in a layer Lq is fixed and 
vertices in an adjacent layer Li are permuted to reduce the number of crossings. 
Exact and heuristic methods for this one-sided crossing minimization problem 
are based on the crossing matrix (c^u^^Ln in which an entry c uv corresponds 
to the number of crossings between the two layers caused by edges incident to u 
and v, if u is placed to the left of v. 
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(a) top view 



(b) rotated view 



Fig. 2. Crossings between edges in different strata can be resolved by 3D rotation. 



We define a strata-aware crossing matrix by multiplying the contribution 
of a pair of edges to an entry c uv with the number of times that these edges 
are present in a common stratum. Note that this results in a weighted crossing 
matrix in which crossings are counted individually for each pair of edges, and 
that this weighting scheme is different from, say, assigning weights to edges and 
multiplying these if edges cross. 

Obviously, our weighted crossing minimization variant is at least as difficult 
as standard crossing minimization. 



5.2 Implementation 

Since most crossing reduction methods based on the crossing matrix are oblivious 
to the definition of its entries, they can be applied in our case as well. 

We adapted the open-source program dot 1 which uses a median heuristic 
coupled with an adjacent-exchange post-processing step [13]. New permutations 
generated by these heuristics are rejected if they lead to an increase in edge 
crossings according to our modified crossing matrix. 

Since this method does not consider crossings until after a permutation is gen- 
erated it was felt that it might not be readily compatible with the new definition 
of c uv . As an alternative we also implemented the Integer Linear Programming 
(ILP) approach suggested in [16]. The ILP method directly uses c uv — c vu as the 
coefficients of the variables of the cost function to find an exact solution. 

In our experiments we found that the median heuristic outperformed the 
ILP approach, because it tends to achieve a reasonable global solution which 
the subsequent adjacent-exchange is able to improve according to our modified 
definition of c uv . Moreover, it is significantly faster than the branch and cut 
algorithm for solving the ILP. 

For 2|D graph drawing, the horizontal coordinate assignment phase should 
be adapted as well. When routing the edges in 2^D we can allow dummy vertices 
on different strata to overlap. In dot’s implementation, an auxiliary graph is 
created in which edges of an arbitrary minimum length are inserted between 

1 Available at http://www.graphviz.org/. 
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adjacent vertices (and dummy vertices) in each layer to keep them separated. 
We therefore set the length of auxiliary edges between adjacent dummy vertices 
that do not coexist on the same stratum to zero, thus allowing such vertices, 
and hence edges, to overlap. This modification led to a significant improvement 
in aspect ratio of the final layout (approx. 40% in our densest test cases). 







Fig. 3. Layouts of individual pathways obtained from a union graph layout. Appearing 
edges are shown dashed, disappearing edges dotted. 



6 Application Example 

The utility of our approach is demonstrated on pathways extracted from the 
KEGG database [17]. The data consists of parts of the glycolysis and fruc- 
tose/mannose metabolism pathways in seven organisms that show significant 
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Table 1. Hamming-distance matrix for seven pathways 



(a) (b) (c) (d) (e) (f) (g) 



(a) 


0 


21 


23 


21 


43 


40 


56 


(b) 


21 


0 


22 


20 


48 


39 


53 


(c) 


23 


22 


0 


10 


38 


35 


45 


(d) 


21 


20 


10 
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34 


31 


41 


(e) 


43 


48 


38 


34 
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15 


31 


(f) 


40 


39 


35 


31 


15 


0 


16 


(g) 


56 


53 


45 


41 


31 


16 


0 



Haemophilus influenzae 
Escherichia coli CFT073 
Streptococcus pyogenes 
Bacillus subtilis 
Arabidopsis thaliana 
Drosophila melanogaster 
Homo sapiens 



differences. Table 1 gives Hamming distances between these pathways with all 
elements P = ( V U E) considered relevant. The order in which the organisms 
are listed is optimal with respect to MIN SUM ORDERING and was computed 
by enumeration. 

Using our adapted version of the dot program described in the previous sec- 
tion, a layout of the union graph of these seven pathways was computed. The 
resulting individual layouts are shown in Fig. 3. 

The 2^D representations shown in Figs. 4 and 5 have been created with the 
WilmaScope 3D graph visualization system [8] . Edge appearances and disappear- 
ances are color-highlighted using green and red. By moving a semi-transparent 
plane through the image, users can navigate forward and backward in the simi- 
larity-ordered sequence of pathways. 
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Fig. 4. 2|D drawing of seven related pathways (parallel projection) 
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(a) A perspect ive project ion (b) A screenshot- showing the cross-sect ion viewer for 

interactive exploration 



Fig. 5. Perspective projection with cross-section viewer for interactive exploration 



7 Discussion 

We have presented an approach to visualize sets of related metabolic pathways. 
It should be noted that the usefulness of the stacking order is dependent on the 
quality of the data, which is frequently poor. However, the biologically mean- 
ingless order thus produced may in turn help discover data inconsistencies. 

Straightforward extensions of our approach include varying distances be- 
tween strata to indicate the actual dissimilarity and varying vertex thickness 
to represent numerical attributes like volume. A more challenging task is the 
computation of interdependent layouts for navigation along a phylogenetic tree 
rather than our one-dimensional ordering of pathways. 
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Abstract. The Unified Modeling Language (UML) has become the soft- 
ware industry’s standard notation for representing software architecture 
and design models. UML diagrams play an important role in the engi- 
neering and re-engineering processes of software systems. Of particular 
interest from the Graph Drawer’s perspective are UML class diagrams 
whose purpose is to display class hierarchies (generalizations), associa- 
tions, aggregations, and compositions in one picture. The combination 
of hierarchical and non-hierarchical relations poses a special challenge 
to a graph layout tool. We present an implementation of our technology 
within well-known modelling tools. 



1 Introduction 

The Unified Modeling Language (UML) by Booch, Rumbaugh and Jacobson 
(see [1]) provides a mainly graphical notation to represent the artifacts of a soft- 
ware system. The most important UML diagram type for software architects is 
the UML class diagram consisting of classes represented by rectangular regions 
containing the class name, attributes and operations of the class, and differ- 
ent kinds of relationships between classes that are represented as lines. Since 
these diagrams are a means of communication between customers, developers, 
and others involved in the software engineering and re-engineering process, it is 
critical that the diagrams present information clearly. An appropriate layout of 
these diagrams can assist in achieving this goal (see [7]). 

We have developed an approach for automatically laying out UML class di- 
agrams in an orthogonal fashion (see [6]). Our approach distinguishes between 
two kinds of relationships: generalizations representing inheritance in class hi- 
erarchies and associations including aggregations and compositions. Inheritance 
hierarchies are emphasized in several ways: 

— generalizations belonging to the same hierarchy are drawn following the same 
direction, 

— nesting of different hierarchies within each other is avoided, 
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— generalizations leading to the same super class join prior to reaching the 
super class, 

— highlight the various class hierarchies by different colors, 

— and highlight the generalizations by color. 

For a clear visualization of the specific combination of hierarchical and non- 
hierarchical components, we put special emphasis on meeting a balanced mixture 
of the above criteria plus the following aesthetic criteria: crossing minimization, 
bend minimization, orthogonality, and horizontal labels. 

The layout functionalities are provided as plug-ins to achieve a tight integra- 
tion into existing tools (in this case, software development tools). This allows the 
user to work within a familiar environment without concerning about an extra 
user interface for graph layout and the data exchange between the different tools. 
Following this strategy, we expect an increasing user acceptance for automatic 
layout algorithms. 



2 The Plug-in Philosophy 

Modern development tools typically come with a graphical software modelling in- 
terface. In order to support developers and designers in the software development 
process, the automatic layout component has to be accessible within their de- 
velopment environment. Therefore we integrated the graph drawing technology 
as plug-ins into existing CASE tools. The layout component supports the soft- 
ware engineer to manage the software projects by arranging the different model 
views from activity diagrams to class diagrams. This approach combines the core 
competencies of both the CASE tool provider and the layout tool provider. 

This demonstration presents a seamless integration of our technology into 
the following two development tools ( applications for short in the following): 

— Gentleware Poseidon for UML ([5]), a low-cost UML CASE tool. It evolved 
from the Open Source project ArgoUML and has a large number of instal- 
lations (currently over 400.000). 

— Borland Together ControlCenter ([2]), an enterprise development platform 
that combines application design, development, and deployment. 

After installing the plug-in, a new menu called GoVisual is available within 
the application. The menu comes along with new graph layout tool buttons 
(see Fig. 1) that give access to various diagram layout algorithms, including the 
orthogonal UML layout algorithm as described in [6]. The user is enabled to 
apply a layout algorithm to a currently active diagram. Moreover, the user may 
adjust options settings of the layout algorithms to meet his aesthetic preferences. 

The orthogonal UML layout algorithm is part of the large GoVisual frame- 
work of layout algorithms and data structures for the automatic layout of di- 
agrams. GoVisual is an object-oriented C++ class library. Our plug-ins work 
by accessing the GoVisual API. As both applications are pure Java, we use 
our API’s Java Native Interface to access the library. The plug-in core and the 
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user interface components are written in Java using the application’s plug-in 
interfaces. 

To make the technology accessible to the end-user, a GoVisual menu is in- 
stalled in the main menu bar of the applications. In addition, there are GoVisual 
layout buttons inserted into the tool bars (see Fig. 1) to allow direct access to 
the new layout functionality. 




» File Edit View 



ojo 1 15 Q a ■ 



Create ( 




(a) ControlCenter 



(b) Poseidon 



Fig. 1 . Easy access through GoVisual menu and toolbar buttons. 



Since layout algorithm parameters are usually hard to understand for users 
that do not have an appropriate graph drawing background, we decided to use 
a simple option interface that prevents the software-developer from being con- 
fronted with these parameters. The options are presented in a user-friendly way, 
i.e. , the parameters are hidden behind an easy-access interface. The quality of 
the crossing minimization procedure, for example, can be selected using a slider 
bar, though, behind the scenes, the selection is translated into a distinguished 
graph algorithms setting (see Fig. 2). 



3 Automatic Layout vs. Built-in Techniques 

Automatic layout functionalities should give the user a clear and concise view 
of the software model. It integrates into the user’s familiar environment as de- 
scribed in Sect. 2, and it offers superior layout capabilities compared to the 
layout functions given by the integration platform. Figure 3 shows a sample 
UML class diagram. The layout in 3(a) has been created with the automatic 
layout functionality of ControlCenter itself. It contains 13 crossings, including 
two crossings of generalizations and nine crossings between generalizations and 
associations. The diagram is difficult to read and does not reveal enough infor- 
mation to the user in order to easily understand the structure of the software 
project. It is especially difficult to identify the inheritance hierarchies. The lay- 
out given in Figure 3(b) has been computed by the GoVisual plug-in. It shows 
only one crossing of two associations. Moreover, the project’s structure becomes 
visible at glance. Especially, the three inheritance hierarchies are easy to recog- 
nize, since the generalizations within the same class hierarchy are drawn in the 
same direction and the three class hierarchies are highlighted by different colors. 
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(a) ControlCenter 




(b) Poseidon 



Fig. 2. Go Visual option pages. 





(a) Automatic layout 
by Together Control- 
Center. 



(b) Automatic layout 
by GoVisual. 



Fig. 3. A Sample UML class diagram automatically laid out by Together Control- 
Center (a) and by GoVisual orthogonal UML layout (b). The GoVisual layout also 
automatically highlights the three inheritance hierarchies using different colors. 



Our techniques clearly outperform the built-in automatic layout algorithms 
(see Figs. 3, 4 and 5). The application’s built-in standard layout algorithms 
may even hide the diagram structure to an extent where it is nearly unreadable 
(see Fig. 4). The Gentleware Poseidon for UML application does not provide an 
advanced automatic layout capability (see Fig. 5 (a)). 
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(a) Automatic layout by To- 
gether ControlCenter. 




Fig. 4. A Sample UML class diagram with a tree structure automatically laid out by 
Together ControlCenter (a) and by GoVisual Tree Layout (b). 




(a) Random layout by Poseidon. (b) Automatic layout by GoVisual. 

Fig. 5. A Sample UML class diagram with no layout given by Poseidon for UML (a) 
and by GoVisual UML Orthogonal Layout (b). 

Some of the GoVisual layout styles are specialized for a certain type of dia- 
gram, e.g. the UML class diagram layout styles, others are variations of standard 
algorithms that can be applied to all kinds of available (UML) diagrams. 

The GoVisual plug-ins provide the following layout styles: 

— UML Orthogonal Layout: The unique GoVisual layout technique (see [6]). 

— UML Hierarchical Layout: A hierarchical layout especially suited for UML 
Class Diagrams (based on [8]). 

— Orthogonal Layout having a focus on the minimization of crossings between 
edges and the minimization of the number of bends of the edges (based 
on [9]). 

— Symmetric Layout using an energy-based layout technique (based on [4]). 

— Tree Layout for the visualization of non-circular structures (based on [3]). 
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All layout styles provide a set of drawing and optimization algorithms that 
can be combined to a layout algorithm that provides the best results for the 
user within the chosen layout style. Furthermore a variant of parameters can be 
manipulated within each style. 

More information about the plug-ins and GoVisual software in general can 
be found at http : / /www . oreas . com. 
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Abstract. We show that an outerplanar graph G with n vertices and 
degree d admits a planar straight-line grid drawing with area 0(dn 1A8 ) 
in 0(n ) time. This implies that if d = o(n 0 ' 52 ), then G can be drawn in 
this fashion in o(n 2 ) area. 



1 Introduction 

A drawing A of a graph is a straight-line drawing, if each edge is drawn as a single 
line-segment. A is a grid drawing if all the vertices have integer coordinates, r 
is a planar drawing, if edges do not intersect each other. Here, we concentrate on 
grid drawings. So, we assume that the plane is covered by an infinite rectangular 
grid consisting of horizontal and vertical channels. Let A be a grid drawing. Let R 
be the smallest rectangle with sides parallel to the A-and Y-axes, respectively, 
that covers r completely. The width ( height ) of r is equal to 1+ width of R 
(1+height of R). The area of T is equal to (1+width of i?)-(l+height of R), 
which is equal to the number of grid points contained within R. The degree of a 
graph is equal to the maximum number of edges incident on a vertex. 

There has been little work done on the area-requirement of planar straight- 
line grid drawings of outerplanar graphs. Currently, the best known upper bound 
on the area of such a drawing of an outerplanar graph with n vertices is 0(n 2 ), 
which is the same as for general planar graphs [3,8]. 

In this paper, we show that an outerplanar graph G with n vertices and 
degree d admits a planar straight-line grid drawing with area 0(dn 1A8 ) in 0(n) 
time. This implies that if d = o(u 0 52 ), then G can be drawn in this fashion in 
o(n 2 ) area. 

In Section 4, we give a brief description of our drawing algorithm (for more 
details, see [6]). It is based on a tree-drawing algorithm of [2], and uses the fact 
that the dual of a maximal outerplanar graph is a tree. 

2 Related Results 

Let G be an outerplanar graph with n vertices. [1] shows that G admits a planar 
polyline drawing as well as a visibility representation with O(nlogn) area. [7] 

* Research supported by NSF CAREER Award IIS-9985136, NSF CISE Research 
Infrastructure Award No. 0101244, and Mark Diamond Research Grant No. 13- 
Summer-2003 from GSA of The State University of New York. 
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shows that G admits a planar polyline drawing with 0(n) area, if G has degree 
at most 4. The technique of [7] can be easily extended to construct a planar 
polyline drawing of G with O(cPn) area, if G has degree d [1], Also, in 3D, G 
admits a crossings-free straight-line grid drawing with 0(n) volume [4,5]. 

3 Preliminaries 

We denote by |G|, the number of vertices (nodes) in a graph (tree) G. An ordered 
tree is one with a pre-specified counterclockwise ordering of edges incident on 
each node. A path P = vovi . . . v q is a, root-to-leaf path of a binary ordered tree 
T, if vo is the root of T, and v q is a leaf of T. A left (right) subtree of P is a 
subtree of T rooted at the left (right) child c of a node of P, such that c does 
not belong to P. The size of a subtree of T is equal to the number of nodes in it. 
The following lemma, which follows directly from Lemma A.l of [2], defines the 
concept of a spine , which is a special kind of a root-to-leaf path (see also [6]). 

Lemma 1 (Lemma A.l of [2]). Let p = 0.48. Given any binary ordered tree 
T with n nodes, there exists a root-to-leaf path P, called spine, such that for any 
left subtree a and right subtree (3 of P, |a| p + |/3| p < (1 — 5)n p , for some constant 
S > 0. Also, assuming that we have already pre-computed the size of the subtree 
rooted at each node v of T and stored it in v, we can compute P in 0(|P|) time. 

Let G be a maximal outerplanar graph, i.e., an outerplanar graph to which 
no edge can be added without destroying its outerplanarity. It is easy to see that 
each internal face of G is a triangle. The dual tree Tq of G is defined as follows: 

— there is a one-to-one correspondence between the nodes of Tq and the inter- 
nal faces of G, and 

— there is an edge e = {u, v ) in Tq if and only if the faces of G corresponding 
to u and v share an edge e' on their boundaries, e and e! are duals of each 
other. 

(Figure 1(b) shows the dual tree of the outerplanar graph of Figure 1(a).) 
Let P = vqVi . . . v q be a path of Tq- Let H be the subgraph of G correspond- 
ing to P. A beam drawing of H is shown in Figure 2, where the vertices of H are 
placed on two horizontal channels, and the faces of H are drawn as triangles. 

A line-segment with end-points a and b is a flat line-segment if a and b 
are grid points, and either belong to the same horizontal channel, or belong to 
adjacent horizontal channels. Let B be a flat line-segment with end-points a and 
b, such that b is at least one unit to the right of a. Let G be an outerplanar 
graph with two distinguished adjacent vertices u and v, such that the edge (it, v) 
is on the external face of G; u and v are called the poles of G. Let D be a planar 
straight-line drawing of G. D is a feasible drawing of G with base B if: 

— the two poles of G are mapped to a and b each, 

— each non-pole vertex of G is placed at least one unit above the lower of a 
and b, at least one unit to the right of a, and at least one unit to left of b. 

Throughout the rest of this paper, for simplicity, by the term outerplanar 
graph, we will mean a maximal outerplanar graph. 
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Fig. 1 . (a) An outerplanar graph G. Here, H, K\. K2, K[, K' 2 , L 1, L2, Mi, N\, N2, 
N3, N4, Qei, Qe 2, Qe 4, and Q e 5 are subgraphs of G, and are themselves outerplanar 
graphs, (b) The dual tree Tg of G. The edges of Tg are shown with dark lines. Note 
that vqVi . . . V13 is a spine of Tg- 



(b) 

(a) 

Fig. 2 . (a) A path P and its corresponding graph H. (b) A beam drawing of H. 




4 Outerplanar Graph Drawing Algorithm 

The drawing algorithm, which we call Algorithm OpDraui, is recursive in nature. 
In each recursive step, it takes as inputs an outerplanar graph G with pre- 
specified poles, and a long-enough flat line-segment B, and constructs a feasible 
drawing D of G with base B. D is constructed by constructing a drawing M of 
the subgraph Z corresponding to a spine of Tq, splitting G into several smaller 
outerplanar graphs after removing Z and some other vertices from it, construct- 
ing feasible drawings of these smaller outerplanar graphs, and then combining 
their drawings with M to obtain D. 
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Fig. 3. The drawing of the outerplanar graph of Figure 1(a) constructed by Algorithm 
OpDraw, in the case, where v is one unit above u. 



Let u and v be the poles of G. Let Tq be the dual tree of G. Let r be the 
node of Tq that corresponds to the internal face F of G that contains the edge 
(u,v). Convert Tq into an ordered tree by making r its root, and assigning the 
edges incident on each node o the same counterclockwise order as the one that 
their dual edges have in the face of G corresponding to o. Note that Tq is a 
binary tree because each internal face of G is a triangle. 

Let P = V 0 V 1 V 2 ■ ■ ■ v q be a spine of Tq, where Vq = r. In general, we can 
interpret the structure of G with respect to P as follows (see Figure 1): Let u, v, w 
be the vertices belonging to face F. Assume that the edge (vq, i>i) of P is the dual 
of the edge (v,w) (the case, where (vo,vi) is the dual of ( u,w ), is symmetrical). 
Let u, w = Co, Ci, . . . , Cm = Cq , c^, . . . , d s be the clockwise order of the neighbors 
of v, where m is the integer such that for each i (1 < i < m), the face Ci-\CiV 
corresponds to the spine node Vi, and for each i (1 < i < s), the face c' i _ l c J i v 
corresponds to a non-spine node of Tq (In Figure 1, m = 3, and s = 2). Let 
iJ, Ki , and K[ be the maximal biconnected subgraphs of G that contains edges 
(■ u,w ), (cj_i,Cj), and (c' i _ 1 ,c' i ), respectively, but not the faces uvw, i;Cj_iCj, and 
vc' i _ 1 c' i , respectively. Let ao, ay, . . . , ah, cth+i, &t be the vertices of K m different 
from c m _i and c m such that ao, ai, . . . , is the clockwise order of the neighbors 
of c m _i, and ah, ah+i, &h+ 2 , ott is the clockwise order of the neighbors of c m . 
For example, in Figure 1, h = 4, and t = 5. Let j be the index such that the 
dual of edge (aj-i,aj) belongs to P (if no such j exists, then we can do the 
following: if K m consists of only one internal face, namely, c m _iC m ao, then set 
j = 0. Otherwise, the leaf v q of P will correspond to either the face a^a\c m -\ or 
the face at-iatC m \ in the first case, set j = 1, and in the second case, set j = t). 
For example, in Figure 1, j = 3. Let Li be the maximal biconnected subgraph of 
G that contains the edge (ccj_i , a*), but not the face ai_iCtiC m _i or a,;_-| 
(whichever exists). Let S = /3q, (3\, ■ ■ ■ , be the clockwise order of the neighbors 
of aj- 1 , aj, . . . ,at (in that order) in the subgraphs Lj,Lj + 1 , . . . , L t , where each 
(3k is different from Qy_i . ay, . . . ,a t . For example, in Figure 1, p, = 8. Let e 
be the index such that the dual of the edge ((3 t -\,(3 t ) belongs to P (if no such 
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e exists, then we can do the following: if Lj consists of only one internal face, 
namely, ay_ioy/3o, then set e = 0. Otherwise, the leaf v q of P will correspond 
to either the face /3 0 /3iay_i or the face /3 M _i /3 M ay; in the first case, set e = 1, 
and in the second case, set e = fi). For example, in Figure 1, e = 2. For each 
i, where 1 < * < e — 1 (e + 1 < £ < /z), if there is an edge (/%_!,/?,), then let 
Mi (iVj_ e ) be the maximal biconnected subgraph of G that contains (/3j_i,/3,), 
but not the face Pi-iptak, where k = j — 1 or j. Let (v p -\,v p ) be the edge of 
P that is the dual of the edge (/3 e _i,/? e ). For example, in Figure 1, p = 9. Let 
R be the subgraph of G that corresponds to the subpath v p v p+ \ ...v q oiP. Let 
e (/3 e -i,/3 e ) be an edge on the external face of R. Let Q e be the maximal 
biconnected subgraph of G than contains e but not the face of R containing e. 

D is constructed as shown in Figure 3 (in this figure, we show the construction 
when v is one unit above u. The other two cases, where u is one unit above v, and 
where u and v are in the same horizontal channel are similar. For more details, 
see [6]): w is placed one unit above u, and ci, C 2 , . . . , c m = Cg, c\, c' s are placed 
in the same horizontal channel one unit above w. ao (at) is placed in the same 
vertical channel as c m _i (c m ). ag, au, . . . , ay-i are placed in the same horizontal 
channel. cty_ i, ay-, . . . , at are placed along a line making 45° with the horizontal 
channels, such that each a*, is one unit above and one unit to the left of a^+i- 
/3o,/?i, ■ • ■ ,/?e— l ar e placed one unit above ay_i in the same horizontal channel. 
/? e , /3 £ +i, . . . , P p are placed one unit below at in the same horizontal channel. 
(3 e _i and /3 e are placed in the same vertical channel. A beam drawing E of R is 
constructed. Feasible drawings of H with base uw , and each Ki (1 < i < m — 1), 
K( (1 < i < s) , Li (1 < i < j — 1), Mi (1 < i < e — 1), Ni (1 < i < /j, — e), and 
Q e , with bases Cj_iCj, c' i _ 1 d i , cty-iay, /?i+ e _i/?i+ e , and e, respectively, are 

recursively constructed, with the horizontal distances between the end-points of 
the line-segments uw, Cj_iCj, c(_ 1 c', aPRTxi, Pi-iPi, Pi+e- iA+e, an d e, equal to 
\H\ - 1, \Ki \ - 1, \K[\ - 1, \Li\ - 1, | Mi| - 1, \Ni\ - 1, and \Q e \ - 1, respectively. 
The drawing of each TV,, and Q e , where e is on the bottom boundary of E, is 
flipped upside-down before placing it in D. Also note that /3g and P p are placed 
such that they are either in the same vertical channel as, or to the right of c' s . 
Also note that at is placed 1 + 9 units above the horizontal channel containing 
c' s , where 9 is maximum height of the feasible drawings of K(, N i7 and Q e , where 
e is on the bottom boundary of E. 

Let h(n) and w(n) be the height and width, respectively, of D , as constructed 
by Algorithm OpDraw. Here, n is the number of vertices in G. Let d be the degree 
of G. Note that, by the definition of feasible drawings, w(n) will be equal to one 
plus the horizontal separation between the end-points of B. 

It is easy to prove using induction that w(n) = n is sufficient. As for h(n), 
first notice that, because G has degree d, t — ( j — 1) is less than 2d, and hence, 
the distance between /3 e _i and /3 e is less than 2d + 2. Let h! be a function, such 
that h' (f) = h(n ), where / is the number of internal faces in G , i.e., the number 
of nodes in the dual tree Tq of G. From the construction of D, we have that: 

h'(f) < max{ max {h'(\T Ni \)} max Jh'(\T Qe \)}, 

edge e on bottom boundary of h, 



134 



A. Garg and A. Rusu 



™^{ti(\T K >\)}} + max{ti(\T H \), max {h'(\T Ki \)} . max {h'(\T Li \)}, 

l<i<s 1 l<t<m— 1 1<2<j — 1 

max {h'(\T Mi \)}, max I)}} + 0(d), 

1<2 <€— 1 edge e on top boundary of h, 



The dual trees of H , K i: L,, Mi, and Q e (where edge e is on top boundary of E) 
are either right subtrees of P, or belong to right subtrees of P. The dual trees 
of K[, Ni, and Q e (where edge e is on bottom boundary of E ) are either left 
subtrees, or belong to left subtrees of P. Hence, from Lemma 1, it follows that: 

h 'U ) < tP max {h'(fi) + h'{f 2 ) + 0(d)}. 

Using induction, we can show that h'(f) = O(df 0A8 ) (see also [2]). Since 
/ = O(n), h[n) = h'U) = O(df 0A8 ) = O(dn 0AS ). 

Theorem 1. Let G be an outerplanar graph with degree d and n vertices. G 
admits a planar straight-line grid drawing with area 0(dn 1A8 ) in 0(n) time. 

Proof. Arbitrarily select any edge e = ( u,v ) on the external face of G, and 
designate u and v as the poles of G. Let B be any horizontal line-segment 
with length n — 1, such that the end-points of B are grid points. Construct 
a feasible drawing D of G with base B using Algorithm OpDraw. From the 
discussion given above, it follows immediately that the area of D will be equal 
to n-O(dn 0A8 ) = 0(dn 1A8 ). Using Lemma 1, we can easily implement Algorithm 
OpDraw such that it will run in 0(n) time. 

Corollary 1. Let G be an outerplanar graph with n vertices and degree d, where 
d = o(n 0 ' 52 ). G admits a planar straight-line grid drawing with o(n 2 ) area in 
0(n) time. 
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Abstract. In this paper we introduce a framework for producing cir- 
cular drawings in which the groupings are user-defined. These types of 
drawings can be used in applications for telecommunications, computer 
networks, social network analysis, project management, and more. This 
fast approach produces drawings in which the user-defined groupings are 
highly visible, each group is laid out with a low number of edge crossings, 
and the number of crossings between intra-group and inter-group edges 
is low. 



1 Introduction 

A circular graph drawing is a visualization in which 

— the graph is partitioned into groups, 

— the nodes of each group are placed onto a unique embedding circle, and 

— each edge is drawn with a straight line. 

Circular graph drawing has received increasing attention in the literature [2, 
4,13,14,20,21,22,24,25]. Kar, Madden, and Gilbert presented a circular drawing 
technique for networks in [13]. This approach partitions the graph into groups, 
places the groups onto embedding circles, and then sets the final coordinates of 
the nodes. As discussed in [4] , an advanced version of this technique is included in 
Tom Sawyer Software’s Graph Layout Toolkit (www.tomsawyersoftware.com). 

Tollis and Xia present several linear time algorithms for the visualization of 
survivable telecommunication networks in [25]. 

Citing a need for graph abstraction and reduction of today’s large information 
structures, Brandenburg describes an approach to draw a path (or cycle) of 
cliques in [2], 

InFlow [16] is a tool to visualize social networks. This tool produces diagrams 
and statistical summaries to pinpoint the strengths and weaknesses within an 
organization (www . inflow . net) . 

* On leave from The University of Texas at Dallas. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 135-146, 2004. 
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We presented a linear time algorithm for producing circular drawings of bi- 
connected graphs on a single embedding circle in [20,21]. This technique was 
extended to place a nonbiconnected graph on a single embedding circle in [20, 
22]. A framework for producing circular drawings of nonbiconnected graphs on 
multiple circles was presented in [20,24]. These techniques require 0(m) time 
and produce drawings with a low number of edge crossings. More details about 
these techniques will be dicussed in Section 2. 

Kaufmann and Wiese extended our circular approach [20,21,22,24] in [14]. 
In this approach, the blocktree structures of nonbiconnected graphs are inter- 
preted in a different manner which allows finer structures to be shown within the 
visualization of a biconnected component. The authors also give an interactive 
version of this technique. An advanced version of this approach is included in 
yWorks’ yFiles Library (www.yworks.com). 

All of these techniques are very useful for applications in 
telecommunications[15], computer networks [20], social network analysis 
[16], project management [16], and more. However, with the exception of the 
Graph Layout Toolkit (GLT) technique [4,13], these techniques do not allow 
the user to define which nodes should be grouped together on an embedding 
circle. And in the GLT technique, the layouts of the user defined groups are 
themselves placed on a single embedding circle. For some graph structures, this 
may not be ideal. In this paper, we present a circular drawing algorithm which 
allows the user to define the node groups, draws each group of nodes efficiently 
and effectively, and visualizes the superstructure well. We call this approach 
user-grouped circular drawing. 

An example of an application in which user-grouped circular drawing would 
be useful is a computer network management system in which the user needs 
to know the current state of the network. It would be very helpful to allow the 
user to group the computers by department, floor, usage rates, or other criteria. 
See Figure 1. This graph drawing could also represent a telecommunications 
network, social network, or even the elements of a large software project. There 
are, of course, many other applications which would benefit from user-grouped 
circular drawing. 

The remainder of this paper is organized as follows: in Section 2, we review 
our previous circular techniques. In Section 3, we introduce a framework for user- 
grouped circular drawing. In Section 4, we discuss a force-directed approach in 
which node placement is restricted to the perimeter of circles. In Section 5, we 
present an algorithm for user-grouped circular drawing. In Section 6, we discuss 
conclusions and future work. 

2 Review of Our Previous Circular Techniques 

As mentioned in the previous section, we have presented multiple efficient circu- 
lar graph drawing techniques which produce visualizations with a low number 
of edge crossings [20,21,22,24]. In this section, we give a brief review of these 
techniques. 
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Fig. 1. A user-grouped circular drawing. 



2.1 Circular Drawings of Biconnected Graphs 

In [20,21], we introduced a linear time technique to produce circular graph draw- 
ings of biconnected graphs on a single embedding circle. First, it is important to 
note the difficulty of this problem. Of course, minimizing the number of crossings 
in a drawing is the well-known NP-Complete crossing number problem [10]. The 
more restricted problem of finding a minimum crossing embedding such that all 
the nodes are placed onto the circumference of a circle and all edges are repre- 
sented with straight lines is also NP-Complete as proven in [17]. The authors 
show the NP-Completeness by giving a polynomial time transformation from 
the NP-Complete Modified Optimal Linear Arrangement problem. 

In order to produce circular drawings with fewer crossings than previous 
techniques, we presented the algorithm CIRCULAR which tends to place edges 
toward the outside of the embedding circle. Also, nodes are placed near their 
neighbors. 

This technique visits the nodes in a wave-like fashion, looking for pair edges 
(edges incident to two nodes which share at least one neighbor) which are then 
removed. Sometimes, triangulation edges are added to aid this process. It is 
the selective edge removal which causes many edges to be placed toward the 
periphery of the embedding circle in the visualizations produced by CIRCULAR. 
Subsequent to the edge removal, CIRCULAR proceeds to perform a Depth-First 
Search (DFS) search on the reduced graph. The longest path of the resulting DFS 
tree is placed on the embedding circle and the remaining nodes are nicely merged 
into this ordering. 

The worst-case time requirement of CIRCULAR is 0(m ), where m is the 
number of edges. An important property of this technique is the guarantee that 
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if a zero-crossing drawing exists for a given biconnected graph, CIRCULAR will 
find it. Such graphs must be outerplanar. In fact, CIRCULAR was inspired by 
the algorithm for recognizing outerplanar graphs presented in [18]. For drawings 
which do contain crossings, a postprocessing method which further reduces the 
number of crossings can be applied. See [20,21] for such a method. 

Extensive experiments compared CIRCULAR and Tom Sawyer Software’s 
GLT. CIRCULAR drawings had 15% fewer crossings. This improvement in- 
creased to 30% with the crossing-reduction postprocessing step. Sample drawings 
from the experimental study are shown in Figure 2. 




Fig. 2. The drawing on the left is produced by the GLT. The drawing on the right is of 
the same graph and is produced by CIRCULAR with crossing-reduction postprocessing. 
The drawing on the right has 75% fewer crossings than the GLT drawing. 



2.2 Circular Drawings of Nonbiconnected Graphs on a Single 
Embedding Circle 

In [20,22] we presented the algorithm CIRCULAR-Nonbiconnected for produc- 
ing circular drawings of nonbiconnected graphs on a single embedding circle. 
Given a nonbiconnected graph G, we can decompose G into biconnected compo- 
nents. In CIRCULAR-Nonbiconnected, we layout the resulting block-cutpoint 
tree on a circle and then layout each biconnected component with a variant of 
CIRCULAR. 

First, we consider how to attain a circular drawing of a tree. A DFS pro- 
duces a numbering that we can use to order the nodes around the embedding 
circle in a crossing-free manner. From this result, we know how to order the bi- 
connected components around the embedding circle. Next, we need to consider 
articulation points which are not adjacent to a bridge ( strict articulation points). 
Strict articulation points appear in multiple biconnected components. In which 
biconnected component should a strict articulation point appear in the circular 
drawing? Multiple approaches to this issue are discussed in [20,22]. Due to space 
restrictions, we do not discuss these solutions here. A third issue to consider is 
how to transform the layout of each biconnected component to fit onto an arc 
of the embedding circle. This transformation is called breaking. The resulting 
breaks occur at an articulation point within the biconnected component. 
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The worst-case time requirement for CIRCULAR-Nonbiconnected is 0(m ) if 
we use CIRCULAR to layout each biconnected component. The resulting draw- 
ings have the property that the nodes of each biconnected component (with the 
exception of some strict articulation points) appear consecutively. Furthermore, 
the order of the biconnected components on the embedding circle are placed 
according to a layout of the accompanying block-cutpoint tree. Therefore, the 
biconnectivity structure of a graph is displayed even though all of the nodes 
appear on a single circle. An example drawing is shown in Figure 3. 




Fig. 3. An example drawing produced by CIRCULAR - Nonbiconnected. 



2.3 Circular Drawings of Nonbiconnected Graphs on Multiple 
Embedding Circles 

In [20,24] we presented Algorithm CIRCULAR-withRadial which produces cir- 
cular drawings of nonbiconnected graphs on multiple embedding circles. As in 
CIRCULAR-Nonbiconnected, we first decompose the given graph G into bicon- 
nected components. Then we layout the block-cutpoint tree with a variant of 
the radial layout technique [1,5,9]. Then each biconnected component is laid out 
with a variant of CIRCULAR. Many details are omitted here. The worst-case 
time requirement is 0(m). 

CIRCULAR-withRadial is a very useful technique and extension of this work 
to include interactive schemes has been presented by Kaufmann and Wiese in 
[14]. CIRCULAR-withRadial groups given graphs by their biconnected compo- 
nents. This grouping helps to provide a logical, beneficial view of nonbiconnected 
graphs, however in some cases it would be helpful to allow the user to define 
which nodes should be placed together on an embedding circle. We present such 
an algorithm in Section 5. 

3 A Framework for User-Grouped Circular Drawing 

The problem of producing circular drawings of graphs grouped by biconnectivity 
is quite different from the problem of drawing a graph whose grouping is user- 
defined. In the latter case, there is no known structure of either the groups or 
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the relationship between the groups. Therefore, we must use a general method 
for producing this type of visualization. The four goals of a user-grouped circular 
drawing technique should be: 

1. the user-defined groupings are highly visible, 

2. each group is laid out with a low number of edge crossings, 

3. the number of crossings between intra-group and inter-group edges is low, 
and 

4. the layout technique is fast. 

We know from previous work in clustered graph drawing [6,7,8,12] that the 
relationship between groups is often not very complex. We take advantage of 
this expectation in this framework. Define the superstructure G s of a given graph 
G = (V,E,P), where P is the node group partition, as follows: the nodes in G s 
represent the elements of P. For each edge e £ E which is incident to nodes in two 
different node groups, place an edge between nodes representing the respective 
groups in G s . The type of structure which we expect G s to have should be 
visualized well with a force-directed [3,5] technique, therefore we will layout 
the superstructure G s with this approach. Since G s will likely not be a very 
complicated graph, it should not take much time to achieve a good drawing 
with a force-directed technique. 

The node groups themselves will be either biconnected or not and since CIR- 
CULAR and CIRCULAR-Nonbiconnected can layout biconnected and nonbi- 
connected graphs on a single embedding circle in linear time and have been 
shown to perform well in practice, we also will use those techniques here. 

We have now addressed how to achieve goals 1 and 2 with good speed. How- 
ever, in order to produce good user-grouped circular graph drawings, we must 
successfully merge these two techniques so that we can simultaneously reach 
goals 1,2, and 3. And, of course, we need a fast technique in order to achieve 
goal 4. Attaining goal 3 is very important to the quality of drawings produced 
by a user-grouped circular drawing technique. As shown in [19], a drawing with 
fewer crossings is more readable. It is especially important to reduce the num- 
ber of intra-group and inter-group edge crossings as those can particularly cause 
confusion while interpreting a drawing. See Figure 4. How can we achieve this 
low number of crossings? We must place nodes which are adjacent to nodes in 
other groups (called outnodes in [4,13]) close to the placement of those other 
nodes. A force-directed approach is a good way to attain this goal since it would 
encourage outnodes to be closer to their neighbors. Traditional force-directed 
approaches [3,5] will not work here though, because we need to constrain the 
placement of nodes to circles. In Section 4, we present a force-directed approach 
in which the nodes are restricted to appear on circular tracks. With the use of 
this technique we will reach goal 3. As will be discussed, we can do this in a 
reasonable amount of time. 

As with most force-directed techniques, the initial placement of nodes has a 
very significant impact on the final drawing [3,5]. Therefore it is important to 
have a good initial placement. This is why we should layout the superstructure 
and each node group first. At the completion of those steps, we should have the 
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Fig. 4. Example of intra-group and inter-group edge crossings. 



almost-final drawing. It will then be a matter of fine-tuning the drawing with 
our new circular-track force-directed technique. And as shown in [23], once you 
have an almost-final drawing, it does not take much time for a force-directed 
technique to converge. 



4 Circular-Track Force-Directed 

In order to adapt the force-directed paradigm for circular drawing, we need a 
way to guarantee that the nodes of a group appear on the circumference of 
an embedding circle, the circular track. The nodes are restricted to appear on 
the circular track, but are allowed to jump over each other and appear in any 
order. See Figure 5. And as in the force-directed approach, we want to minimize 
the potential energy in the spring system which is modelling the graph. In this 
section, we describe how this circular-track adaptation can be achieved. 




Fig. 5. Circular-track force-directed technique. 



First, we need to look at node coordinates in a different way. Node i belongs to 
group a and is located at position (xj, yt). Given that the center of the embedding 
circle on which a is located is at ( x a ,y a ) and the radius of that circle is r a , we 
can restate the coordinates of i in the following way: 



Xi = x a + r a * cos{9i) 



(1) 
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Vi = y a + r a * sin(9i ) (2) 

Remember that Hooke’s Law [11] gives us the following equation for the 
potential energy V in a spring system: 

V = ^ ] kij \{xi — Xj) + ( yi — Vj) ] (3) 

ij 

where kij is the spring constant for the spring between nodes i and j. Equa- 
tion (3) can be rewritten using (1) and (2): 

V = J2(i,j)eE k ij [((*« + r » * cos(Oi)) - ( xp + rp * cos(9 J ))) 2 + 

(( y a +r a * sin(Oi)) - ( yp +rp* sin(9j))) 2 ] (4) 

where node j belongs to group /3, (xp,yp) is the center and rp is the radius 
of the embedding circle on which /3 appears. Following through on the minus 
signs, we rewrite: 

V = J2(i,j)GE k ij [( X « + r a * COs(9i) ~Xp~Tp* COs(9j)) 2 + 

(z la + r a * sin(9i) -yp-rp* sin(9 0 )) 2 ] (5) 

We can find a minimal energy solution on variables x, y and 9. It is interesting 
to note that if i and j are on the same circle, then x a and xp are equivalent as 
are y a and yp. And, of course, r a = rp. Now we rewrite equation (5): 

V= ^2 k ij[ r a{cos(9i) — cos(9j)) 2 + r a {sin(9i) — sin(9j)) 2 )} (6) 

We can calculate r a from the number of nodes in a, so that means that 
finding the minimum V is now a one-dimensional problem based on finding the 
right set of 9s. When we combine (5) or (6) with equations for magnetic repulsion 
to prevent node occlusion, we have a force-directed equation for which the nodes 
of a group lie on the circumference of a circle. Now we extend equation (5) to 
include repulsive forces. 

Pij = [{x a + r a * cos(9i) - xp-rp* cos(9j)) 2 + 

(y a + r a * sin(9i) -yp-rp* sin(9j )) 2 ] (7) 

V= kij (®) 
(i,j)evxv 

where gij is the repulsive constant between nodes i and j. The force on node 
i by node j is: 



Fi = 



V ( 9i + e, 9j) — V(9i — e, 9j) 
2e 



( 9 ) 



where e is a very small constant. 
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Another important consideration is the set of spring constants used in the 
above equations. It is not necessary for the spring constant to be the same 
for each pair of nodes. It is also possible for these constants to change during 
different phases of execution. 

5 A Technique for Creating User-Grouped Circular 
Drawings 

Now that we have a force-directed technique in which the nodes are placed on 
circular tracks, we need to show how we will successfully merge the force-directed 
approach and circular drawing techniques of [20,21,22,24]. We now present a 
technique for creating user-grouped circular drawings. 

Algorithm 1 CIRCULARwithFORCES 

Input: A graph G = (V, E, P). 

Output: User-grouped circular drawing of G r. 

1. Determine the superstructure G a of G. 

2. Layout G s with a basic force-directed technique. 

3. For each group pi in P 

a) If the subgraph induced by p,;, G,, is biconnected 
layout Gi with CIRCULAR. 

b) Else layout Gi with CIRCULAR-Nonbiconnected. 

4. Place the layout of each group pi at the respective location found in Step 2. 

5. For each group Pi 

a) rotate the layout circle and keep the position which has the lowest local 
potential energy. 

b) reverse the order of the nodes around the embedding circle and repeat 
Step 5a. 

c) if the result of Step 5a had a lower local potential energy than that of 
Step 5b revert to the result of Step 5a. 

6. Apply a force-directed technique using the equations of Section 4 to G. 

Going back to the four goals discussed in Section 3, we will attain goal 1 
by using a basic force-directed technique to layout the superstructure. We will 
attain goal 2 by laying out each group with either CIRCULAR or CIRCULAR- 
Nonbiconnected. Attaining goal 3 means successfully merging the results of the 
force-directed and circular techniques. 

Once we have the layout of the superstructure and each group, we place the 
layout of each group at the respective location found during the layout of the 
superstucture. Now we have an almost-final layout: it is a matter of rotating the 
layouts of the groups and maybe adjusting the order of nodes around the embed- 
ding circle. Since we know that CIRCULAR and CIRCULAR-Nonbiconnected 
produce good visualizations, we should change these layouts as little as possi- 
ble. So first, we will fine-tune the almost-final drawing by rotating each layout 
and keeping the rotation which has the least local potential energy. We rotate 
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each embedding circle through n a positions, where n a is the number of nodes 
in the group a. With respect to determing local potential energy, we need to 
determine the lengths of inter-group edges which are incident to the nodes of 
a. The rotation of choice should minimize the lengths of those edges. In other 
words, we choose the rotation in which as many nodes as possible are close to 
their other-group neighbors. Since for each embedding circle we try n a positions 
and examine the length of a’s incident inter-group edges at each position, then 
the rotation step will take 0(n * winter- group) time for the entire graph, where 
minter-group is the number of inter-group edges. As discussed in Section 3, we 
expect Winter- group ■C w. Then we will “flip” each layout and again rotate. We 
keep the rotation which has the least local potential energy. After these steps, 
it is still possible that some nodes will be badly placed with respect to their re- 
lationships with nodes in other groups. In other words, those placements cause 
intra-group and inter-group edges to cross. In order to address this problem, we 
will apply the force-directed technique described in Section 4. The result of this 
step will be the reduction of intra-group and inter-group edge crossings since 
nodes will be pulled to the side of the embedding circle which is closer to their 
other-group relatives. 

Because Algorithm 1 makes use of a force-directed technique, the worst- 
case time requirement is unknown. However, in practice, we expect the time 
requirement to be 0{n 2 ) for the following reasons: Step 1 requires O(m) time. 
Step 2 will be on a small graph and should not require much time to reach 
convergence. Step 3 requires 0(w) time. Step 4 requires 0(n) time. Step 5 
require 0(n * Winter-group) time. Since Step 6 is a force-directed technique it 
could take 0(n 3 ) time in practice, however the result of the previous steps will 
be an almost-final layout and thus should not need much time to converge. 
It was evidenced in [23] that when a force-directed technique is applied to an 
almost-final layout, it does not take much more time for convergence to occur. 
Therefore, in practice we expect this step to require 0(n 2 ) time. Thus, we have 
attained goal 4 from Section 3. 




Fig. 6. Sample user-grouped circular drawing from our preliminary implementation. 



We have done a preliminary implementation of CIRCULARwithFORCES in 
TCL/TK. In this implementation, all nodes and embedding circles are given an 
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arbitrary initial placement. Then the force-directed equations of Section 4 are 
applied to the graph with the placement of group embedding circles frozen. See 
Figure 6 for a sample drawing. 

An interesting behavior we noticed is that the drawing with minimal energy 
is not necessarily the best circular drawing. In circular drawing, a major goal is 
to reduce edge crossings. However, it is well known [3] that reducing crossings 
sometimes means the compromise of other aesthetics, especially area. And area 
is related to minimum energy in spring systems. We propose adding springs from 
each node to its initial placement on the plane with the spring constants for these 
springs being high. This should keep these nodes from gravitating towards each 
other too much and causing extra crossings. We also suggest creating dummy 
nodes which are placed in the center of each embedding circle and attaching 
strong springs from them to every node in their respective group. 



6 Conclusions and Future Work 

In this paper, we have presented a framework for creating circular graph draw- 
ings in which the grouping is defined by the user. This framework includes the 
successful merging of the force-directed and circular graph drawing paradigms. 
We introduced the algorithm CIRCULARwithFORCES that produces drawings 
in which the user-defined groupings are highly visible, each group is laid out with 
a low number of edge crossings, and the number of crossings between intra-group 
and inter- group edges is low. This layout technique is also fast. 

In the future we would like to extend this approach to include layouts in 
which nodes are placed on the periphery of shapes other than circles or on 
curves. We would also like to develop a technique which uses this approach in 
three dimensions. 
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Abstract. In this paper we consider the problem of drawing a planar 
graph using circular-arcs as edges, given a one-to-one mapping between 
the vertices of the graph and a set of n points on the plane, where n is 
the number of vertices in the graph. If for every edge we have only two 
possible circular arcs, then a simple reduction to 2SAT yields an 0(n 2 ) 
algorithm to find out if a drawing with no crossings can be realized. We 
present an improved 0(n 7 ' A polylog n ) time algorithm. For the special 
case where the possible circular arcs for each edge are of the same length, 
we present an even more efficient algorithm that runs in 0(n 3 ^ 2 polylog n) 
time. We also consider the problem if we have more than two possible 
circular arcs per edge and show that the problem becomes NP-Hard. 
Moreover, we show that two optimization versions of the problem are 
also NP-Hard. 



1 Introduction 

A natural question that arises in graph drawing is whether a graph with fixed 
vertices can be drawn without crossings, when several choices are given for each 
of the edges. From an information visualization point of view convex edges are 
preferable, i.e., straight line segments or circular arcs. In general, embedding a 
planar graph at fixed locations and drawing it with straight lines may result in 
many crossings. Using circular arcs instead can reduce or eliminate the crossings; 
see Fig. 1(a). Thus, a natural problem to consider is whether a given graph with 
fixed vertex locations can be drawn without crossings, using circular arcs. 

We first consider the 2- Circle Drawing (2 CD) problem, in which each edge has 
to be drawn as one of the two circular arcs defined by a circle passing through 
the endpoints. This problem is reminiscent of the Manhattan wiring problem: 
Consider the axis-aligned rectangle with a diagonal defined by the line segment 
between two vertices connected by an edge. Then the two semi-rectangles sepa- 
rated by the diagonal are the two choices for drawing the edge. This formulation 
of the problem can be efficiently solved in 0(n log n) time, using an efficient find 
and delete data structure (to find intersections between a pair of semi-rectangles 
and to delete a semi-rectangle from the data structure) [14]. 
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Fig. 1 . (a) 12(n 2 ) crossings with straight-line edges and none with half-circles; (b) A 
planar graph that cannot be drawn without crossings using any circular arc segments. 

(c) Given the circle Ci, edge a = (u,v) is drawn either with the circular arc d or cf. 

(d) An example of a 2SAT reduction: (cTV C3) A (cTVcl) A (03 VC2) A (C1VC2) A (ci VC2). 



The same approach cannot be applied to the 2CD problem directly. Although 
efficient data structures exist for operations on full circles, no such data struc- 
tures exist in the case of circular arcs (semi-circles). The novelty of our 2CD 
algorithm is that we provide a way to use an efficient data structure for full 
circles to solve the problem with circular arcs. For the sake of completeness, 
we first show that the 2CD problem can be reduced to 2SAT and thus solved 
in 0(n 2 ) time. Then we provide our novel approach to solve the 2CD problem 
in time 0(n 7 ^polylog n) for the general case and in time 0(n 3 ^ 2 polylog n) for 
the restricted case where the circular arcs are exactly half-circles. Although the 
practical gain in terms of running time is not significant, we believe that our 
approach for solving the 2CD problem might be of independent interest to solve 
similar problems. 

Next, we consider the 3-Circle Drawing (3CD) problem, where for each edge 
there are three circular arcs to choose from. We show that the 3CD problem is 
NP-hard. Although using circular arcs to represent the edges allows a certain 
flexibility, not every planar graph can be drawn without crossings using circu- 
lar arcs. Fig. 1(b) shows an example of a planar graph that cannot be drawn 
without crossings using any circular arc segments. This difficulty suggests two 
optimization problems: Min2CD is the problem of minimizing the number of 
crossings for a given 2CD instance by representing every edge with an appro- 
priate circular arc. Max2CD is the problem of maximizing the number of edges 
that can be drawn without crossings using circular arcs. We show that both of 
these optimization problems are NP-hard. 

2 Previous Work 

Several variations of the problem of embedding a planar graph at fixed point 
locations have been studied. If we can choose the mapping between the vertices 
V and the points P, then Kaufmann and Wiese [15] show that the graph can be 
drawn without crossings using 2 bends per edge in polynomial time. However, if 
the mapping between V and P is given, Pach and Wenger [19] show that 0(n ) 
bends per edge are necessary to guarantee planarity, where n is the number of 
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vertices in the graph. Godau [12] shows that if each vertex is allowed to move 
slightly in the neighborhood of a fixed point then the problem becomes NP-hard. 

Drawing graphs with circular arcs with no assigned vertex locations has been 
considered by Cheng et al [5] in the context of planarity, angular resolution and 
drawing area. The problems under consideration in this paper are also related 
to the k-position point labeling problem , extensively studied in map labeling [3, 
20,21]. In the k-position point labeling problem we are given a set of points 
and a set of k possible label positions for each point and we would like to find a 
labeling of the points that optimizes specific criteria. Criteria such as maximizing 
the number of labeled points [3,23] and maximizing the size of the labels [6,7, 
11] have been considered. A variant of the map labeling problem is reduced 
to 2SAT and NP-Completeness results are presented in [11]. Although these 
problems are related to drawing planar graphs with circular arcs, there is a 
significant difference: whereas map labeling problems are restricted by region 
intersections, drawing planar graphs with circular arcs is restricted by circular 
arc intersections. 



3 Circular Arcs Drawing: 2CD 

The input to the problem is a planar graph G = (V, E), a point set P, and a 
one-to-one function / : V — > P such that \V\ = \P\ = n and \E\ = m. Note that 
m = 0(n) as G is a planar graph. For any edge = (u, v) € E, we are given 
some circle C, that passes through u and v. The two vertices, u and v, determine 
two circular arcs on Cp. let Ci and cf be their labels, see Fig 1(c). We would like 
to find out whether G can be drawn without crossings using Ci or cf for each e*, 
and if so to provide such a drawing of G. 

We first suggest a straightforward solution of the problem using a reduction 
from 2CD to 2SAT. The reduction to a 2SAT formula requires that we identify 
all intersections between circular arcs. For each such intersection Ci tlCj ^ 0, we 
add the formula (cf Vq) to see Fig 1(d). Since there are 0(n 2 ) crossings, the 
reduction takes 0[n 2 ) time and results in a 2SAT formula $ with 0(n ) variables 
and 0(n 2 ) clauses. It is easy to see that G can be drawn without intersections if 
and only if the corresponding formula is satisfiable. Since 2SAT can be solved 
in time linear in the number of clauses and variables [4,10], we have an 0(n 2 ) 
time algorithm for the 2CD problem. However we can do much better. 



3.1 The 2CD Algorithm 

Note that for a given edge e* = (u, v ) £ E, as possible drawings of e, we consider 
the circular arcs defined by the circle Ci , and the position of u and v on C{. 
Alternatively, we could consider the axis-aligned rectangle having its diagonal 
as the line segment (u, v ) , and then consider the 2 semi-rectangles separated 
by the diagonal as two choices for drawings of the edge e*. This formulation of 
the problem is known as the Manhattan wiring problem which can be efficiently 
solved in O(nlogn) time [14]. 
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Algorithm 2CD 

while T) not empty 
let Ci be a circle in T) 
delete(V, Ci) 
delete ci from V 
traversejpossible(ci) 
start with initial data structure T> 
while 3 Ci, Cj £ P s.t. d fl Cj ^ 0 
let dfsnwmber(a) < dfsnumber(cj) 
delete(V, Ci) 
delete Ci and ci from V 
add ci into C 
traverse-certain(ci) 
if 3 intersecting half-circles in C 
output No 
else output C U V 



Traverse_possible(ci) 

while Cj=find(T>,Ci) not empty 
delete(V, Cj) 

let Cj be involved in the intersection 
delete Cj from V 
traversejpossible(cj) 

Traverse_certain(ci) 

while Cj=find(T>,a) not empty 
deletefD, Cj) 

let Cj be involved in the intersection 
delete Cj and cj from V 
add cj into C 
traverse-certain(cj) 



Fig. 2. Algorithm 2CD. The input to the algorithm is T>, the data structure used to 
store all the circles. If it is possible to draw the graph without crossings, the algorithm 
outputs the set of circular arcs used to draw the graph. 



We describe a new algorithm that solves a more general problem for circular 
arcs. Our approach is different from the Manhattan wiring problem in that we 
perform operations only on complete circles. More formally, we find/delete com- 
plete circles as part of an intersection, as opposed to performing the operations 
on the circular arcs directly. Let V denote the data structure used to store all the 
circles. Let find(T>,Cj ) be a function that finds a circle Cj intersecting circular 
arc Ci, and delete(T>,Ci) be a function that deletes the circle C, from V. Let 
a(n),/3(n) denote the time required to perform the find and delete operations, 
respectively. We next describe how to construct the data structure T> and how 
to perform the find/delete operations efficiently. The main algorithm is shown 
on Fig. 2. 

Let a possible circular arc be one that is not yet chosen and not yet discarded 
and let V denote the set of possible circular arcs. V initially contains all the 
circular arcs. We traverse the circles in a depth-first manner starting with an 
arbitrary circular arc Cj and making all possible assignments. Each circle is found 
and deleted exactly once, resulting in a sequence of 0{n) find/delete operations 
and requires 0{n x ( a(n ) + /3 (n)) time. At the end of the first while loop in 
the main algorithm, the set V contains exactly one circular arc for each edge. 
However, V might contain intersections. At this point we make the following 
observation: 

Observation 1. Let d,Cj £ V. If Cj IT Cj ^ 0 and dfsnumber(ci) < 
dfsnumber(cj) , then the implication Cj => Cj holds, and ci must be included in 
the final solution. 
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Let C denote the set of certain circular arcs. Initially C is empty. Once we find a 
circular arc, c^, that is certainly in the final solution by the above observation, 
we perform a traversal from Ck, placing all certain arcs in the set C. To find 
the intersections in V it suffices to perform a plane sweep over the whole set V. 
Since, whenever we encounter an intersection in V we delete the whole circle, 
the plane sweep over V finds 0(n ) intersections in total. Then the second while 
loop requires 0(nlogn + n x (a(n) + f3{n))) time. Finally we end up with a set 
C of certain circular arcs, and a set V of possible circular arcs. At this point we 
make a second observation: 

Observation 2. Let Ci,Cj £ CUV. If Cj fl Cj ^ 0, then Ci,Cj £ C. 

The observation holds for the following reasons: Assume Ci,Cj £ V were true. 
Then we must have encountered the intersection in the plane sweep step in which 
case one of them would have been deleted from V. So, Ci, Cj £ V can not be true. 
On the other hand assume, Cj £ C and Cj £ V were true. Then we must have 
traversed through Cj before visiting Ci in the traversal step. But when we traverse 
through a circle, we delete the whole circle from V. So this can not be the case 
either. 

Then we need to concentrate on the intersections in C. We perform a final 
plane sweep over the set C. If we encounter an intersection, then there cannot be 
an assignment without intersections, otherwise C U V gives us a feasible assign- 
ment. The running time of the algorithm is the time required for the two while 
loops in the main algorithm: 0(n log n + n x (a(n) + /3(n))). In the next section 
we describe the data structure that supports the needed operations. 

3.2 The Data Structure 

Given a circular arc query Ci, finding and deleting a circle Cj that intersects 
Ci is more efficient than performing the same operations on a circular arc Cj 
that intersects Cj. This observation led us to the 2CD algorithm which assumes 
the existence of a data structure T> that stores all the circles and allows for 
efficient find/delete operations. Gupta et al [13] show how to reduce the problem 
of querying circles with a circular arc to one of half-space range searching in 
higher dimensions. The method requires at most a 4-dimensional half-space range 
searching. To report such intersections then, we make use of the ideas from 
geometric range- searching [1,2,18]. The main data structure we use is a partition 
tree, constructed using the partitioning theorem by Matousek [17]: a point set P 
can be partitioned into 0{n 1 ~ l t d ) classes in time 0(n log n 1_1 / d ) such that for 
any class Pi, |Pi| < 2 xn 1 ^ and any line l intersects at most 0(nf 1 ~ 1 / d ' > ) classes, 
where d is the dimension of the search space. Using this partitioning theorem 
we can create a data structure T>' that performs half-space range queries in time 
0(n 1_1 / d (logn)°^ 1 ' > ). Moreover V is dynamic, in the sense that we can delete 
a circle from T>' in amortized time O(logn). Then using multiple levels of T>' 
to satisfy the intersection conditions of [13], we create the data structure V 
that supports findlfD , Ci) operations in 0(n 3 ^ 4 polylog n) time and that requires 
0(n log n) time for a sequence of O(n) delete(T>,Ci ) operations. These results 
can be summarized with the following theorem for the 2CD problem. 
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Fig. 3. (a) Dj contains an endpoint of a (b) center (Cj) £ m and center(Cj) £ Dzj ■ 



Theorem 1. The 2CD problem can be solved in time 0(n 7 ^polylog n). 



3.3 Allequal2CD 

We can solve a restricted version of the 2CD problem even more efficiently. Let 
Allequal2CD be the version of 2CD where each edge e, has the same length and 
the line segment between the endpoints of e, is the diameter of circle Cj. In this 
case, for a given edge e i: the circular arcs Cj and Cj are half-circles. We present an 
algorithm to solve Allequal2CD in 0(n 3 ^ 2 polylog n) time using a data structure 
T> that enables us to perform efficient find/delete operations. We provide the 
details for the construction of T> here, since the general data structure described 
above can be constructed in a similar fashion. 

Let center(Ci ) and separator(Cf) denote, respectively, the center of Cj and 
the line separating the half-circles Cj and Cj. Define 7Tj as the half-plane bounded 
by separator(Cj), and that contains Cj. Let Dj be the disk bounded by the circle 
Cj, and let D 2 i be the disk concentric to Dj but with radius twice the radius of 
Dj, see Fig 3. The following lemma is easy to verify. 

Lemma 1. A circle Cj intersects a half-circle Ci if and only if (i) Dj contains 
an endpoint of Ci, or (ii) center(Cj) £ 7Tj and center(Ci) £ D 2 j. 

In order to report intersections of the first type we use the data structure 
described by Efrat et al in [9]: given n equal-sized disks in the plane, construct 
a data structure D7i in time 0(n log n) such that for a given query point p, 
finding a disk that contains p requires 0(log n) time. Moreover, deleting a disk 
from D7i requires amortized time O(logn). We preprocess the disks Dj for each 
i using this structure. 

To deal with the intersections of the second type we make use of the partition 
tree V described above. However, this time we perform half-space range search- 
ing in 2-dimensions using a two-level data structure. Let the data structure for 
the second type of intersections be D72- The first level of VT 2 is a partition 
tree, 'DT 2 ■ Based on the partitioning theorem described above, we partition the 
centers of all the circles and recursively build the partition tree VT 2 ■ The leaves 
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of VT 2 partition the centers into constant-sized subsets. Each internal node v is 
associated with a subset P v of the points contained in the leaves of the subtree 
rooted at v. We build the second level of the data structure based on these sub- 
sets. The second level data structure used in VT 2 is the same as X>71 , except we 
preprocess the disks Z? 2 i for each i, rather than the disks Di as is the case for 
constructing T>T \ . We call this second level data structure PT 2 ' to distinguish it 
from T>T\ which we used to find the first type of intersections. Each internal node 
v in PT 2 contains a pointer to the corresponding T> T 2 , where T> T 2 contains 
the data structure for all the disks D 2 i centered at P v . The preprocessing time 
for constructing the partition in a node of T> T 2 with m points is 0(m log to). 
Constructing VT 2 " for the same node also takes time O (m log m) . Since the 
number of points in nodes of PT 2 decreases as a double exponential with their 
depth in the tree, the total preprocessing time is 0(n log n). 

Theorem 2. Allequal2CD problem can be solved in time 0(n 3 ^ 2 poly log n). 

To find a circle Cj intersecting a given a half-circle Ci we first query T > 71 with Ci s 
endpoints. This step requires O(logn) time. If we cannot find such a circle then 
we query PP 2 with separator(Ci). Upon finding an internal node v such that 
P v lies completely above separator(C)), we query the associated VT 2 " of v with 
center(Ci). Let a(n) be the time to find a circle intersecting a given half-circle 
Cj. Then a(n) is bounded by the query time of PP 2 and we get: 

a(n) < 0{y/n) x log 2 y/n + 0(tfn) x a(2 y/n) (1) 

Thus the time required to perform a find operation is a(n) = 0(y/npolylog n). 

In order to delete a circle Ci , we first delete Di from T>T\ in O(logn) amor- 
tized time. We also need to delete the appropriate disks in T>72- To do this we 
simply find each internal node v of such that center(Cj) £ P v , and delete 
the corresponding disk from PT 2 ", the second level data structure pointed to 
by v. Since VT 2 has depth O(loglogn) and deleting a disk from VT 2 " takes 
amortized time O(logn), the deletion of a circle takes 0( log 2 n) amortized time. 
Since find and delete operations are defined for both 2?7i and PP 2 , the two data 
structures form the complete data structure T> and the theorem follows. 

4 The 3CD Problem 

The 3CD problem is similar to 2CD, except now we have three choices for the 
drawing of each edge e* = (u,v). We consider the problem in which in ad- 
dition to the two half-circles we can also choose the line segment connecting 
u and v. We show that 3CD is NP-hard using a reduction from the NP-hard 
PLANAR-3SAT [16]. A 3SAT instance $ is called a PLANAR-3SAT instance 
if the (bipartite) occurrence graph G& = (V&,E$) is planar. In the occurrence 
graph, V$ contains a vertex for each variable and clause, and E$ contains an 
edge between two vertices v,w € V<p if v represents a variable x that occurs 
in the clause represented by w. Let VR3S AT (Variable Restricted 3SAT) be the 
version of 3SAT with the restriction that each variable can appear at most three 
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(a) (b) (c) 

Fig. 4. (a) Variable-circle of xq (b) Clause-circle of (xi V j/j V z/t); (c) Circle drawing of 
(xi V y V 2 ) A (aTfVa^) . . .. Each edge is represented by 2-link-chains (two parallel chains 
of small circles). Since the graph is planar there will be no crossings of the chains. 



times, and VR1IN3SAT be the version of VR3SAT in which exactly one literal in 
each clause is required to be true. In the planar versions of these two problems, 
the occurrence graphs of the input instances must be planar. We will convert 
a PLANAR-3SAT instance <L> into a 3CD instance ^3 cd through a series of 
modifications that preserve planarity. 

Lemma 2. PLANAR- VR3SAT is NP-hard. 

Proof Sketch: Due to space constraints we leave the proof of this lemma to the 

full version of the paper [8] . □ 

Lemma 3. PLANAR- VR1IN3SAT is NP-hard. 

Proof Sketch: Due to space constraints we leave the proof of this lemma to the 

full version of the paper [8] . □ 

Theorem 3. The 3CD problem is NP-hard. 

Proof Sketch: We convert a PLANAR- VR1IN3SAT instance L>" into a 3CD 

instance T> 3 cd- Because of the VR reduction, the occurrence graph G$" for <L>" 
has maximum degree 3. Then there exists an orthogonal drawing for G$" (a 
drawing such that each vertex is on the integer grid and each edge consists of 
horizontal and vertical edge segments) and the grid is of size quadratic in the 
size of G4," [22]. Given the orthogonal drawing of G$", we obtain <P 3 cd by the 
following method: We replace each vertex corresponding to a variable Xi with 
a variable- circle, with one half labeled Xi and the other xf, see Fig. 4(a). We 
replace each vertex corresponding to a clause, say, (xi V y.j V Zk) of T>" , with a 
clause-circle having one half-circle labeled Xj, one labeled yj, and the diameter 
of the circle labeled Zk, see Fig. 4(b). We represent each edge of G$" with a 
2-link-chain which consists of two parallel links of chain-circles. Let be the 
edge of G$" between the vertex corresponding to the clause (xi V yj V Zk) and 
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the vertex corresponding to the variable Xj. Then we represent by a 2-link- 
chain, where one of the links is connected to the half-circle of the clause-circle 
labeled with Xi on one end, and to the half-circle x,; of the variable-circle on the 
other. The other link intersects with both the half-circle yj and the diameter 
Zk of the clause-circle on one end, and is connected to the half-circle Xi of the 
variable-circle on the other, see Fig. 4(c). 

We claim that <!>" is satisfiable if and only if ^ 3 cd has a feasible assign- 
ment without crossings. Assume that <&' is a satisfiable instance of PLANAR- 
VR1IN3SAT, and let a be a satisfying assignment. A feasible assignment of 
edges in ^ 3 cd is as follows: For each variable-circle corresponding to variable X{, 
assign the half-circle labeled with Xi or xi depending on whether Xj is assigned 
to true or false in a respectively. For each clause-circle corresponding to a clause 
(Xj V yj V Zk), assign the half-circle (or diameter) corresponding to the (only) 
true literal in the clause, as determined by a. For each 2-link-chain connected 
to the variable-circle of Xj, if Xi is assigned to true in a , then for the link that 
is connected to Xj , assign the first chain-circle by choosing the half-circle that 
does not cross the Xj, and continue assigning the chain-circles through the link 
without creating any crossings. For the 2 nd link that is connected to the xi half- 
circle, assign the first chain-circle by choosing the half-circle that crosses the 
half-circle xi, and continue assigning the chain-circles through the link without 
creating any crossings. This assignment does not contain any crossings. The only 
crossings that could occur would be between a chain-circle at the tip of a link 
and a clause-circle, but our method of assigning the chain-circles eliminates this 
possibility. 

For the other direction, assume that @3 cd has a feasible assignment of edges 
without crossings. Then, finding a truth assignment a for <P” is straightforward: 
For each variable-circle corresponding to a variable x,, if the half-circle labeled 
with Xi is chosen, then assign x,; to be true, otherwise assign it false. This yields 
a satisfying assignment, since the feasible assignment of edges in $3 cd chooses 
exactly one edge from each clause-circle such that there are no conflicts with the 
variable assignments and the true literal assignment for the other clauses. □ 

5 Drawing with Few Crossings 

If G cannot be drawn without crossings using half-circles, there are two natural 
optimization problems. Define Min2CD as the following decision problem: Given 
( G = (V,E),kmin), where G is a planar graph, and kmin is a non-negative 
integer, does there exist an assignment of half-circles (either Cj or cf) for each e* 6 
E such that the number of crossings is at most kmin ? The second optimization 
problem, Max2CD, is defined as follows: Given (G = ( V,E),kmax ), where G 
is a planar graph, and Umax is a non-negative integer, does there exist an 
assignment of half-circles (either Cj or ci) for some e* £ E such that there are 
no crossings and the number of assigned edges is at least Umax 1 - We prove that 
both problems are NP-hard by reductions from the Planar Degree-3 Independent 
Set problem (PD3IS). Let H = ( Vh,Eh ) be an undirected graph. We say that 
a set I C Vh is independent if for all pairs where i,j £ I, ( i,j ) £ Eh- 
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(a) 







(b) 



Fig. 5. (a) Min2CD reduction. Vertex circles correspond to vertices in H, chain circles 
correspond to edges in H, and tail circles are auxiliary circles, (b) The original graph 
H drawn on an integer grid. 



The PD3IS problem is the following: Given ( H = (Vjj, E h), kind), where H is a 
planar graph with maximum degree 3 and kind is a non-negative integer, does 
there exist an independent set / with |I|=k/jvd? 

Lemma 4. PD3IS is NP-hard. 

Proof Sketch: Due to space constraints we leave the proof of this lemma to the 

full version of the paper [8] . □ 

Theorem 4. Min2CD is NP-hard. 

Proof Sketch: Let ( H = (Vh, Eh), kind) be an instance of PD3IS. The reduc- 

tion produces a Min2CD instance (G, kmin), with kmin = nn — kind- Since H 
has maximum degree 3, we can find an orthogonal drawing of H , such that each 
vertex is on the integer grid of size quadratic in the size of H [22], see Fig. 5(b). 
The reduction scales the grid of H by a factor of nn — kind + 1 and replaces the 
vertices of Ed with vertex- circles, circles of diameter (u/f — k/atd + 1) units, see the 
large circles in Fig. 5(a). Each edge of H is represented with nn — kind + 1 links 
of chain-circles, circles having half a unit diameter connected to a vertex-circle 
at its head. Each vertex-circle also has a tail circle, connected to it at its tail in 
such a way that the diameter of the tail-circle crosses the tail of the vertex-circle. 
Since the given graph iJ is planar, we can obtain such a grid drawing of circles 
without causing any intersection between the chain-circles. 

We claim that H has an independent set of size kind if and only if G can 
be drawn using half-circles with at most nn — kind crossings. Assume H has 
an independent set I, where | I \=kind- Then there are kind vertices in iJ 
that are pairwise disconnected, which further implies that in G there are kind 
vertex-circles which are not connected by links. Then a feasible assignment of 
half-circles which allows a drawing of G with at most nn — kind crossings 
follows easily: For each vertex-circle, if the vertex corresponding to it is in I, 
then assign the head of the vertex-circle, otherwise assign the tail as chosen. 
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This results in an assignment that will have at least kind heads, and at most 
uh — kind tails. The chain-circles of the links connected to a vertex-circle which 
is already assigned to its head are assigned so that no crossing is created, i.e., 
starting from the chain-circle attached to the already assigned head, choose the 
half-circle that does not create any crossings. The chain-circles of the other links 
are assigned edges in a very similar way, but this time with no condition on the 
assignment of the first chain-circle. Finally the tail-circles are assigned randomly 
to the half-circles. Such an assignment assigns half-circles for every circle in the 
drawing, and creates no more than nn — kind crossings. The only crossings 
created are those between the tail-circles and the vertex-circles assigned to their 
tails. We already know that there are at most nn — kind such vertex tails, which 
implies the first direction of the claim. 

For the other direction, assume G has an assignment of half-circles with at 
most nn — kind crossings. Let Cn (Ct) be the sets of vertex-circles having 
their heads (respectively tails) chosen in this assignment We know that | Ct |< 
nn — kind , since otherwise the assignment would create more than nn — kind 
crossings. This implies that | Cn |> kind , since | Ct \ + | Cn |= nn- For 
any ( Ci,Cj ) pair, where Ci, Cj € Cn-, there cannot be any links between c, and Cj 
because if c % . Cj were linked together, each of the nn — kind + 1 links would have 
at least one crossing, creating more than nn — kind crossings which would be 
a contradiction. Let I be the set of vertices corresponding to the vertex-circles 
in Cn', then I is an independent set with size at least kind- □ 



Theorem 5. Max2CD is NP-hard. 

Proof Sketch: The reduction is again from PD3IS. Let ( H = (Vn, Eh), kind) 

be an instance of PD3IS. The reduction produces a Max2CD instance 
(G, km ax), with kmax = tn — nn + kind, where tn is the total number 
of circles in G. The proof proceeds along the lines of the proof of Theorem 4 
with a slight modification: in this case we add nn — kind + 1 tail-circles to each 
vertex-circle, rather than just one tail-circle. Then H has an independent set of 
size kind if and only if G can be drawn without any crossings such that at least 
kmax = tn — nn + kind edges have been assigned to some half-circle. □ 



6 Open Problems 



We conclude with two open problems: (1) Can Min2CD and Max2CD be ap- 
proximated within a constant factor? (2) Can we draw graphs with pre-specified 
vertex positions, using elliptic or other parabolic curve segments without creat- 
ing too many crossings? 
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Abstract. Trees are usually drawn using planar straight-line drawings. 
[1] presented an algorithm for constructing a planar straight-line grid 
drawing of an n-node binary tree with area O(n) and any pre-specified 
aspect ratio in the range [n - “,n“], where 0 < a < 1 is any constant, 
in 0(n log n) time. Unfortunately, the algorithm of [1] is not suitable for 
practical use. The main problem is that the constant hidden in the “Oh” 
notation for area is quite large (e.g., it can be as large as 3900). 

In this paper, we have made several practical improvements to the al- 
gorithm, which make it suitable for practical use. We have also con- 
ducted experiments on this newer version of the algorithm for randomly- 
generated and complete binary trees with up to 50, 000, and 65, 535 
nodes, respectively. Our experiments show that it constructs area- 
efficient drawings in practice, with area at most 10 times and 8 times 
the number of nodes for randomly-generated and complete binary trees, 
respectively. 



1 Introduction 

A drawing A of a tree is a straight-line drawing, if each edge is drawn as a single 
line-segment. A is a grid drawing if all the nodes have integer coordinates, T is 
a planar drawing, if edges do not intersect each other. Here, we concentrate on 
grid drawings. So, we will assume that the plane is covered by a rectangular grid. 
Let A be a grid drawing. Let R be the smallest rectangle with sides parallel to 
the X- and F-axes, respectively, that covers r completely. The width ( height ) of 
r is equal to 1+ width of R (1+height of R). The area of r is equal to (1+width 
of A) -(1+height of R) 1 which is equal to the number of grid points contained 
within R. The aspect ratio of T is the ratio of its width and height. 
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2 Our Result 

Planar straight-line drawings are more esthetically pleasing than non-planar 
polyline drawings. Grid drawings guarantee at least unit distance between the 
nodes, and the integer coordinates of the nodes allow the drawings to be dis- 
played in a display surface, such as a computer screen, without any distortions 
due to rounding-off errors. Giving users control over the aspect ratio of a draw- 
ing allows them to display the drawing in different kinds of display surfaces with 
different aspect ratios. Finally, it is important to minimize the area of a drawing, 
so that the users can display it in small display surfaces also. 

[1] presented an algorithm for constructing a planar straight-line grid drawing 
of an n-node binary tree T with area O(n) and with any pre-specified aspect ratio 
A in the range [n _ “,n“], where 0 < a < 1 is any constant, in 0{n log n) time 
(the algorithm actually takes three input parameters: T, A, and a user-defined 
constant e, such that 0 < e < 1 and n~ e < A < rf). 

While the algorithm of [1] was significant from a theoretical point of view, it 
suffered from the following drawbacks, that made it unsuitable for practical use: 

— The constant c hidden in the “Oh” notation for area can be quite large (e.g., 
it can be as large as 3900 for e = 0.6, and A = 1). One might argue that c 
is really the worst-case bound, and the algorithm might perform better in 
practice. However, the problem is that given a tree T with n nodes, and two 
numbers e and A as input, the algorithm will always pre-allocate a rectangle 
R with size exactly equal to cn, and draw T within R. Thus, the area of R is 
always equal to the worst-case area, and correspondingly, the drawing also 
has a large area. This is the major drawback of this algorithm. 

— Also, it uses another algorithm, called Algorithm u* -HV-Draw, as a subrou- 
tine. This increases the complexity of implementing the algorithm. 

In this paper, we have made several practical improvements to the algorithm, 
which make it more suitable for practical use: (Note that the area of the drawing 
constructed by this newer version of the algorithm is still 0(n )) 

— We have developed a newer version of the algorithm that does not require 
the pre-assignment of a rectangle with the worst-case area to draw a tree. 
Instead, it only pre-assigns an aspect ratio to the tree, which is used to draw 
the tree recursively in a bottom-up fashion. This makes it possible for the 
algorithm to construct a more area-efficient drawing in practice. 

— This newer version does not require Algorithm u* -HV-Draw as a subroutine, 
which makes it easier to implement. 

— The proof for the area being 0(n) given in [1] is based on a theorem by 
Valiant (Theorem 6 of [3]). Unfortunately, the most natural way of using the 
theorem seemed to be requiring the pre-assignment of a rectangle (with a 
large area). Hence, developing an algorithm that does not pre-assign a rect- 
angle required developing a new proof. Correspondingly, we have developed 
a new proof that does not use the theorem. Instead, it simply uses Induction. 
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— We have also implemented this newer version, and experimentally evaluated 
its performance for randomly-generated binary trees with up to 50,000 nodes, 
and for complete binary trees with up to 65,535 = 2 16 — 1 nodes. Our 
experiments show that it constructs area-efficient drawings in practice, with 
area at most 8 times the number of nodes for complete binary trees, and at 
most 10 times the number of nodes for randomly-generated binary trees. 

Due to space limitations, in this paper, we only present a brief overview of 
this newer version of the algorithm. Full details are given in [2]. 

3 Preliminaries 

Let T be an n-node binary tree, with one distinguished node v, which has at 
most one child, v is called the link node of T. T is an ordered tree if the children 
of each node are assigned a left-to-right order. A partial tree of T is a connected 
subgraph of T . If T is an ordered tree, then the leftmost pathpoi T is the maximal 
path consisting of nodes that are the left children of their parents, except the 
first one, which is the root of T; the last node of p is called the leftmost node of 
T . T is an empty tree, if it has zero nodes in it. 

Let r be a planar straight-line grid drawing of T. T has a good aspect ratio, 
if its aspect ratio is in the range [n~ a ,n a ], where 0 < a < 1 is a constant. Let r 
be the root of T. Let u* be the link node of T. T is a feasible drawing of T, if it 
has the following three properties: 

— Property 1: The root r is placed at the top-left corner of T. 

— Property 2: If u* ^ r, then u* is placed at the bottom boundary of T. 
Moreover, we can move u* downwards in its vertical channel by any distance 
without causing any edge-crossings in T . 

— Property 3: If u* = r, then no other node or edge of T is placed on, or 
crosses the vertical and horizontal channels occupied by r. 



Theorem 1 (Separator Theorem [3]). Every n-node binary tree T contains 
an edge e, called a separator edge, such that removing e from T splits T into two 
trees with at most (2/3 )n nodes each. Moreover, e can be found in 0(n) time. 

4 Our Tree Drawing Algorithm 

Let T be a ?r-node binary tree with a link node u* . Let A and e be numbers such 
that 0 < e < 1, and n~ e < A < n e . A is called the desirable aspect ratio for T. 

Our tree drawing algorithm, called DrawTree, takes e, A, and T as inputs, 
and uses a simple divide-and-conquer strategy to recursively construct a feasible 
drawing T of T, by performing the following actions at each recursive step: 

— Split Tree : Convert T into an ordered tree, in which u* is the leftmost node. 
Using Theorem 1, find a separator edge e = (u,v). Using e, split T into at 
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most five partial trees by removing at most two nodes and their incident 
edges from it. We get two cases: in Case 1 (see Figure 1(a)), e is not in 
the leftmost path of T, and in Case 2 (see Figure 1(b)), e is in the leftmost 
path of T. The partial trees obtained in Case 1 are Ta, Tp, T\, T 2 , Tc, and 
in Case 2 are Ta,Tb,Tq . The nodes removed are a and u in Case 1, and 
u in Case 2. Based on which partial trees are empty, we get 7 subcases for 
Case 1 and 8 subcases for Case 2. These subcases are described in detail in [2] 
(Figures 1(a), and 1(b) show the subcases for Case 1 and 2, respectively, in 
which all the partial trees are non-empty). 

— Assign Aspect Ratios : Correspondingly, assign a desirable aspect ratio A k to 
each partial tree T Let rik be number of nodes in Tk- Tk is a large partial tree 
of T if either A > 1 and nk > (n J M) 1 /( 1+e ), or A < 1 and nk > (An) 1 ^ 1+e ^; 
Tfc is a small partial tree otherwise. 

The assignment of Ak to Tk is done as follows: Let Xk = rik/n. 

• If A > 1: If Tk is a large partial tree of T, then Ak = XkA , otherwise 
(i.e. , if Tk is a small partial tree of T) Ak = i\ e . 

• If A < 1: If Tk is a large partial tree of T, then Ak = A/xk , otherwise 
(i.e., if Tk is a small partial tree of T) Ak = n%. 

Moreover, if A > 1, and Tk = Ta or Tk = Tp, then the value of Ak is 
changed to 1/Ak ■ This is done so because later in Step Compose Drawings , 
when A > 1, Ta and Tp are rotated by 90°, when constructing T. 
Intuitively, the above assignment strategy ensures that each partial tree gets 
a good desirable aspect ratio. 

— Draw Partial Trees : Recursively construct a feasible drawing Tk of each par- 
tial tree Tk with Ak as its desirable aspect ratio. 

— Compose Drawings : Arrange the drawings of the partial trees, and draw the 
nodes and edges, that were removed from T to split it, to obtain a feasible 
drawing T of T . If A < 1, then the drawings of the partial trees are stacked 
one above the other, and if A > 1, they are placed side-by-side. Also note 
that when A > 1, the drawing of Ta and Tp are rotated clockwise by 90° and 
flipped left-to-right before placing in T. Figure 1(c) and 1(e) (Figures 1(d) 
and 1(f)) show the arrangement, when A < 1, and A > 1, respectively, for 
the case shown in Figure 1(a) (Figure 1(b)). For the arrangement in all the 
subcases of Case 1 and 2, see [2]. 

Figure 2 shows a drawing of the complete binary tree with 63 nodes con- 
structed by Algorithm Draw Tree, with A = 1 and e = 0.2. 

Theorem 2. Let T be a binary tree with n nodes. Given any number A, where 
n~ a < A < n a , for some constant a, where 0 < a < 1, we can construct in 
0{n log n) time , a planar straight-line grid drawing of T with 0{n) area , and 
aspect ratio A, using Algorithm DrawTree. 

Proof. Omitted here due to lack of space. See [2] for the proof. 
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(e) (f) 




Fig. 1. General structure of tree T in (a) Case 1, and (b) Case 2, respectively; Drawing 
T, when A < 1, in (c) Case 1, and (d) Case 2, respectively, and when A > 1, in (e) 
Case 1, and (f) Case 2, respectively. 




Fig. 2. Drawing of the complete binary tree with 63 nodes constructed by Algorithm 
Draw Tree, with A = 1 and e = 0.2. 



5 Experimental Results 

We have implemented the algorithm using about 2100 lines of C++ code, and 
evaluated its performance on two types of binary trees, namely, randomly- 
generated, with up to 50,000 nodes, and complete, with up to 65,535 nodes. 

Recall that the algorithm takes three values as input: a binary tree T with 
n nodes, a number e, where 0 < e < 1, and a number A in the range [n~ £ ,n £ ]. 

The performance criteria we have used to evaluate the algorithm is the ratio 
c of the area of the drawing constructed of T, and n. 

To evaluate the algorithm, we varied n up to 50, 000 for randomly-generated, 
and up to 65, 535 = 2 16 — 1 for complete binary trees. For each n, we used five 
different values for e, namely, 0.1, 0.25, 0.5, 0.75, and 0.9. For each (n, e) pair, 
we used 20 different values of A uniformly distributed in the range [l,n e ]. The 
performance of the algorithm is symmetrical for A < 1 and A > 1. Hence, we 
varied A only from 1 through n £ , not from n~ £ through n £ . Hence, in the rest 
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c = area/n 
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(a) (b) 

Fig. 3. (a) Performance of the algorithm, as given by the value of c, for drawing a 
randomly-generated binary tree T with e = 0.5, and different values of A, where c=area 
of drawing/number of nodes n in T. (b) Projection of the 3D plot of (a) on XZ- plane. 



of the section, we will assume that A > 1. For each type of tree (randomly- 
generated and complete), and for each triplet (n, A, e), we generated three trees 
of that type. We constructed a drawing of each tree using the algorithm, and 
computed the value of c. Next, we averaged the values of c obtained for the three 
trees to get a single value for each triplet (n, A, e) for each tree-type. 

Our experiments show that the value of c is generally small, and is at most 
10 for randomly-generated, and at most 8 for complete trees. Figure 3 shows 
how c varies with n, and A, for e = 0.5 for randomly-generated trees. (See [2], 
to see how c varies with n, and A, for all the five values of e for both randomly- 
generated and complete binary trees.) 

We also discovered that c increases with A for a given n and e. However, the 
rate of increase is very small. Consequently, for a given n and e, the range for c 
over all the values of A is small (see Figure 3). E.g., for n = 10, 000, and e = 0.5, 
for randomly-generated trees, the range for c is [4.2, 5.2]. 

Similarly, for a given n and A, c increases with e. 

Finally, we would like to comment that the aspect ratio A act of the drawing 
r constructed by the algorithm is, in general, different from the input aspect 
ratio A. We computed the ratio r of A act and A. We discovered that r is close 
to 1 for A = 1, generally decreases as we increase A, and can get as low as 0.1 
for A = n e . However, we also discovered that for a large range of values for A , 
namely, [1, min {n e , n/ log 2 n}], r stays within the range [0.8, 1.5], and so is close 
to 1. Hence, if we need to construct a drawing with aspect ratio exactly equal 
to A, we can do so by adding enough “white space” to r. This will increase the 
area of T by a factor of at most max{l/0.8, 1.5} = 1.5 (assuming that A is in 
the above-mentioned range). Hence, the area of r will still be small. 
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Abstract. The problem of exact symmetry detection in general graphs 
has received much attention recently. In spite of its NP-hardness, two 
different algorithms have been presented that in general can solve this 
problem quickly in practice [5,2], However, as most graphs do not admit 
any exact symmetry at all, the much harder problem of fuzzy symmetry 
detection arises: a minimal number of certain modifications of the graph 
should be allowed in order to make it symmetric. We present a general 
approach to this problem: we allow arbitrary edge deletions and edge 
creations; every single modification can be given an individual weight. 
We apply integer programming techniques to solve this problem exactly 
or heuristically and give runtime results for a first implementation. 



1 Introduction 

An important aim in Automatic Graph Drawing is the display of symmetric 
structure inherent in a graph. Empirical studies show that symmetric drawings 
of a graph are much easier to understand than asymmetric ones [14]. However, 
two different problems occur when trying to develop algorithms for symmetric 
graph drawing: first, the symmetry detection problem is NP-hard for general 
graphs [13]. Second, most graphs do not admit any exact symmetry at all; in 
order to create nearly symmetric drawings, one has to define some kind of re- 
laxation of exact symmetry. 

For the first problem, two different algorithms with exponential runtime in 
general but fast runtime in practice have been proposed recently. In [4], we 
presented an approach based on integer programming techniques; see [5] and [3] 
also. Later, Abelson et al. [2] devised an algorithm based on group-theoretic 
methods, running even faster. 

The second problem is motivated by the observation that nearly symmetric 
drawings still help the user to understand the structure of a graph; see Fig. 1 for 
some examples. Unfortunately, mostly negative results about this problem have 
been published so far. In fact, detecting fuzzy symmetries is much harder than 
detecting exact symmetries in general, as shown by Chen et al. [6]. In their ap- 
proach, different types of modifications of the graph are allowed in order to make 
it symmetric, namely, to delete nodes, to delete edges, and to contract edges. 
They consider the problem of finding a minimum number of such operations 
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needed to obtain a graph admitting an axial or rotational symmetry. Among 
other results, they show that some of the corresponding problems are NP-hard 
even for trees. On the other hand, exact planar symmetry detection is possible 
in linear time for planar graphs [9,10,11,12]. 




In this paper, we follow a similar approach: we allow a minimum number 
of edge deletions and edge creations in order to obtain a graph that admits a 
symmetric drawing. A more precise description of this problem, along with a 
generalization, is given in Sect. 3. Before, we gather some definitions concerning 
symmetries in Sect. 2. In Sect. 4, we present integer linear programs (ILPs) 
modeling the fuzzy rotation and fuzzy reflection problems. The corresponding 
polytopes are investigated in Sect. 5. Finally, we roughly explain the framework 
of a branch & cut-algorithm for fuzzy symmetry detection in Sect. 6. There we 
also report runtime results for a first implementation of this algorithm. For more 
details or any proof, the reader is referred to [3]. 

2 Preliminaries 

For ease of exposition, we only consider simple undirected graphs in the following. 
An automorphism of a graph G = (V, E) is a permutation 7r of V with (*, j) £ E 
if and only if (7r(i), 7r(j)) £ E for all i,j £ V. The set of automorphisms of G 
forms a group with respect to composition, denoted by Aut(G). The order of an 
automorphism 7r is ord(7r) = min{/c £ IN | 7r fc = idy}, where idy denotes the 
identity permutation of V. For a node * £ V, the set orb^i) = {tt k [i) \ k £ IN} is 
the 7 T-orbit of i. Finally, the fixed nodes are those in Fix(7r) = {* £ V \ ir(i) = i}. 

A reflection of G is an automorphism n £ Aut(G) with n 2 = idy, i.e., 
an automorphism of order one or two. For k £ {1, . . . , n}, a k-rotation of G 
is an automorphism 7r £ Aut(G) such that |orb 7r (i)| £ (1, A:} for all i £ V 
and |Fix(7r)| < 1 if k 1. Each 2-rotation is a reflection, but not vice versa; the 
identity idy is both a reflection and a 1-rotation. 

Now assume that there exist an injective node placement p: V — > 1R 2 and an 
isometry ip: 1R 2 — > ]R 2 of the Euclidean plane with the following properties: for 
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each i £ V there exists a node i' £ V with = p(i'), and for i,j £ V we 

have (i,j) £ E if and only if ( i',j' ) £ E; for straight-line drawings of simple 
graphs, this means nodes are mapped to nodes and edges are mapped to edges. 
Then p and ip induce an automorphism n of G by setting n (i) = i' . Any auto- 
morphism induced like this is called a geometric automorphism or a symmetry 
of G. An automorphism of a graph G is a symmetry if and only if it is a rotation 
or a reflection [8] . Obviously, the symmetries displayed by a single placement p 
form a subgroup of Aut(G), but the set of all symmetries of G is not closed un- 
der composition. Observe that this definition of a symmetry, following the one 
given in [8], allows degenerate drawings: nodes may be located on non-incident 
edges and edges may overlap. In fact, in the case of a reflection, the subgraph 
induced by the fixed nodes may be arbitrarily large, so that degeneracies may 
be unavoidable. However, all nodes must have different positions. 



3 The Fuzzy Symmetry Detection Problem 

The general idea of our approach to fuzzy symmetry detection is to allow to 
change the adjacency of every pair of nodes in G. More precisely, we allow 
to create an edge between each pair of non-adjacent nodes and to delete the 
edge between each pair of adjacent nodes. Creating or deleting an edge between 
two nodes i and j of G involves a penalty of w t j £ 1R. The aim is to find a 
graph G' = (V, E') such that G' is symmetric and such that the total penalty for 
transforming G into G' is minimal. In other words, let EAE' = (E\E')U(E'\E) 
denote the symmetric difference between E and E' . Then our objective is to 
minimize the total penalty 

Y Wi i 

(iJ)EEAE' 

over all graphs G’ = (V,E') that admit any non-trivial symmetry. We call this 
problem the weighted fuzzy symmetry detection problem. 

Observe that we do not require positive weights. However, allowing negative 
weights does not increase the power of the approach, since all negative weights 
can be replaced by their absolute values after changing the adjacency of the 
corresponding edges in G. The presence of zero weights means that the adjacency 
of the corresponding node pairs does not matter at all. By using large enough 
weights, we can also forbid certain modifications absolutely. 

The weighted fuzzy symmetry detection problem is NP-hard, even if all 
weights are one. This follows from the NP-hardness of the exact symmetry detec- 
tion problem. In practice, fuzzy symmetry detection is much harder than exact 
symmetry detection — at least for our solution methods. 

Finally notice that we could also allow node deletions in our approach. For 
simplicity, however, we restrict ourselves to edge modifications here. 
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4 An Integer Linear Programming Model 

In order to describe fuzzy symmetries by integer linear programs, we split up 
the problem in the following way: we fix an integer k G {2, ... ,n} and consider 
the problem of finding a graph G' that admits a symmetry of order k and that 
differs minimally from G. By applying this approach to all feasible orders one 
after another, we can solve the original problem. 

Hence we set up a separate fuzzy rotation ILP for each k > 3 such that k 
divides either n or n — 1 and a fuzzy reflection ILP for k = 2. All these pro- 
grams are based on the fuzzy automorphism ILP that uses two different types of 
variables to model fuzzy automorphisms. 

The mapping variables are used to specify a permutation n of V: we define 
a real n x n-matrix M (n) by 

„ , f 1 if n(i) = ? 

“ { 0 otherwise, 

yielding a monomorphism M of the group S n of permutations of V into the 
general linear group GL„(1R). The matrices in M(S n ) are called permutation 

matrices and can be characterized as the set of n x n-matrices X = (xij) 

with Xij G {0,1} and 

y, Xij = 1 for all i G V and y Xjj = 1 for all j G V . (1) 

jev iev 

In other words, for the desired ILP, we can use a binary mapping variable Xij for 
each pair (i, j) G V 2 and add the constraints (1). A value of one for the mapping 
variable Xij is interpreted as mapping node i to node j. 

For technical reasons, we replace the permutation constraints (1) by the 
weaker constraints 

x^ < 1 for all i G V and y Xjj < 1 for all j G V . (2) 

j&V iGV 

Thus the permutation corresponding to the mapping variables may be partially 
or totally undefined. However, by adding a large value to all objective function 
coefficients of mapping variables, every optimal solution will still correspond to 
a well-defined rotation or reflection. 

The second type of variables is used to model all graphs G' with node set V. 
We use a binary edge variable yij for each pair of nodes i,j G V that is set to 
one if and only if the nodes i and j are adjacent in G' . 

The connection between the two types of variables is established as follows: 
we want the permutation 7r of V given by the mapping variables to be an au- 
tomorphism of the graph G' given by the edge variables. In other words, if two 
nodes i and j are adjacent in G' , then the same must be true for 7 r(z) and 7 r(j), 
and vice versa. This can be translated into 



Xiiji + Xi 2 j 2 < 2 ± y ili2 =F Vj 1 j 2 for all ii,h,ji,j 2 £ V . 



(3) 
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Indeed, these constraints are trivial if the left hand side is at most one. If the 
left hand side is two, then 7r(zi) = ji and 7r (z 2 ) = j 2 . In this case, the right hand 
sides make sure that = Vj 1 j 2 > be., either both pairs (zi,z 2 ) and (j 1; j 2 ) are 
adjacent or both are non-adjacent in G' . 

In summary, fuzzy automorphisms correspond to the solutions of 





x ij 


G 


{0,1} 


for 


all 


Vij = 


Vji 


G 


{0,1} 


for 


all 




Xij 


< 


1 


for 


all 


Siev 


x ij 


< 


1 


for 


all 


%iiji % 


*2J2 


< 


“ i Viii 2 “F Vjij2 


for 


all 



i,j £ V 
i,j € V, i^j 

ieV (4) 

j e V 

h, h, 31,32 e V, i 2 , ji . 



Next, we are going to restrict the ILP (4) to rotations of order k > 3. The 
first condition for n to be a fc-rotation is |orb„-(z)| G {1, k} for all i £ V. All other 
orbit lengths are forbidden. In order to set up linear constraints to enforce this 
property of w, we define a forbidden orbit of G as a circular list C = (zi, . . . , i p ) 
of pairwise distinct nodes in V with k ^ p >2. Let \C\ = p and 



v - 1 

z(G) = y ' x hh+l A x i p ii ■ 

t = 1 



Then the condition |orb ff (z)| G {1, A:} for all i G V is equivalent to 



x(C) < |C| — 1 for all forbidden orbits C . (5) 



Indeed, the constraint (5) implies that for every forbidden orbit C at least one 
variable in x(C) is zero, so that C is not an orbit of tt. To this point, we have 
ensured that all orbits of tt have length k or 1. Any fc-rotation has to meet the 
additional condition |Fix(7r)| < 1. We can express this by 



5 >«< 1 , ( 6 ) 

i&V 

as the sum on the left hand side equals the number of fixed nodes. Combining 
the constraints (4), (5), and (6), an ILP describing fuzzy fc-rotations of G is 



Xij 


G 


{0,1} 


for 


all 


II 

;<>s 


G 


{0,1} 


for 


all 


X) jev x ij 


< 


1 


for 


all 


Xiey x ij 


< 


1 


for 


all 


x iiji “F x i 2 j2 


< 


“ ± T Vjiji 


for 


all 


x(C) 


< 


\C\-1 


for 


all 


Xigy x a 


< 


1 . 







i,j G V 
i,j S V, j 
i G V 

J e V (7) 

h,i2,ji,j2 e V, h ^z 2 , ji ± j 2 
forbidden orbits C 



Fuzzy reflections of G are easier to model than fc-rotations: instead of using 
the forbidden orbit constraints, we just have to ensure that the automorphism 7r 
to be represented satisfies rr = idy, i.e. , that in our model we have 



Xij — 



Xji for all i,j G V . 



( 8 ) 
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In other words, we only need a single mapping variable for each pair of nodes. 
Hence an ILP modeling fuzzy reflections is given by 



Xij — 


Xji 


€ 


{0,1} 


for 


all 


i,j e V 


yij = 


Vji 


e 


{0,1} 


for 


all 


i,j e V, j 


Yjj^v 


Xij 


< 


1 


for 


all 


i e V 


Siev 


x ij 


< 


1 


for 


all 


j&v 




i2]2 


< 


1 T Viii 2 “F Vjij 2 


for 


all 


*1, *2, Jl, J2 G V, il¥= «2, j 1 



(9) 



Observe that both (7) and (9) do not depend on the structure of G; they 
are determined by the number of nodes of G. The information about edges 
of G is stored in the objective function: we minimize the total weight of the 
modifications that are necessary to get from G to G' , i.e. , we minimize 



Y wijVij+ Y "oO- ?/,,). 

(iJ)GV 2 \E ( i,j)eE 



Independently, we can assign arbitrary weights to the mapping variables. In the 
case k = 2, fixed nodes should be punished, since otherwise the graph G together 
with idy forms an optimal solution of (9). However, the penalty for fixed nodes 
has to be attuned to the penalties for edge deletion or creation in this case. 



5 The Fuzzy Symmetry Polytopes 

The ILPs (7) and (9) only depend on the number n of nodes and the desired 
order fc, but not on the structure of G. Thus we may define the fuzzy symmetry 
polytope FSP(fc,n) to be the polytope corresponding to (7), if k > 3, or the 
polytope corresponding to (9), if k = 2. More precisely, these polytopes are 
given as the convex hulls of the sets of solutions of these ILPs. 

Theorem 1. The polytope FSP(k,n) is full- dimensional. 

For designing the branch & cut-algorithm for fuzzy symmetry detection, it is 
important to investigate the polyhedral structure of FSP(/c,n). In particular, we 
have to find valid inequalities that induce facets of FSP(fc, n), i.e., maximal faces 
of this polytope. Before starting to present classes of such inequalities, observe 

Lemma 1. If a constraint H is valid or facet-inducing for FSP(k,n), then the 
same is true after replacing each variable yij in H by 1 — y^j . 

Lemma 2. If a constraint H is valid or facet-inducing for FSP(k,n), then the 
same is true after replacing each variable x^ in H by Xji. 

Both results follow from symmetry immediately. In the remainder of this sec- 
tion, we will usually consider only one of the four constraints without explicitly 
mentioning the other three. 

Theorem 2. The trivial constraints Xij > 0, yij > 0, and y^ < 1 induce facets 
of FSP(k,n). The same is true for the weak permutation constraints (2). 

To obtain further results about the polytope FSP(/c, n), we will treat rotations 
and reflections separately in the following subsections. 
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5.1 The Fuzzy Rotation Polytopes 

Throughout this subsection we assume k > 3. In this case, we have 

Theorem 3. The constraint (6) induces a facet of FSP(k,n). 

We continue with a review of the forbidden cycle constraints (5). For this, 
let C = i p ) be a circular list of pairwise distinct nodes with k ^ p > 2 

again. If k does not divide p , we can improve (5) to 

Y Xij<p-1 . (10) 

hfeC, i^j 

Indeed, the sum on the left hand side is at most p by (2) . If it equals p for some 
permutation 7 r, then C is fixed under 7r, but no single node of C is fixed. Since k 
does not divide p , some non-trivial orbit of 7r must have a length not equal to k, 
so 7r is no fc-rotation. 

By the same reasoning, if k does not divide p — 1 either, we even have the 
valid inequality 

Y x ij <p-i ■ (n) 

i,jec 

Observe that for k = n the constraint (11) is the subtour elimination constraint 
for the cut defined by C. 

Theorem 4. Let 2 < p < n — 1 and assume that k does not divide p. If k 
divides p— 1, then (10) induces a facet of FSP{k,n). Otherwise, (11) induces a 
facet of FSP(k , n) . 

Next, let Wi and Wi be any two subsets of V. Let ii ,*2 G V \ (W\ U W 2 ) 
with i\ ^ * 2 - Then we have the following valid inequality for FSP(fc,?r): 

'y ^ x iiji + y ^ x i2j2 ~ 2 _ yi^ + y ^ %i j2 • ( 12 ) 

heWi j 2 ew 2 hew lt j 2 <ew 2 

Indeed, the left hand side is at most two by (2), while the right hand side is at 
least one. If the left hand side equals two, we have X{ 1 j 1 = 1 for some j 1 g Wi 
and Xi 2 j 2 = 1 for some (2 € W%. Thus y, ll2 = 1 implies Uj 1 j 2 = 1, so that the 
right hand side is at least two. 

Theorem 5. The constraint (12) induces a facet of FSP(k,n) if and only if 

(a) \Wi\ 7 ^ 0 and \W 2 \ ± 0, 

(b) |W 2 \ Wi| ^ 1 and \W± \ W 2 I ^ 1, and 

(c) j W\ U W 2 1 > 2. 

In the special case W\ = W 2 = {ji, j' 2 }, the constraint (12) and its counterpart 
according to Lemma 1 emerge as 

x iiji + x iij 2 -b x i 2 j 1 “b x i232 — 2 i Vi\i 2 T Vhh' 

These constraints improve (3); both induce facets of FSP(fc,n) by Theorem 5. 
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Finally, let W C V and w € V \ W. Let i\ ,i 2 € V \ {W U {w}) with i\ ^ i 2 . 
Then 

^ ( ( x ilj T x i' 2 j ) 4“ ^Xiiin + ‘^ jX i2W — 3 Viiit 4“ ^ ( l/ju; (13) 

jew jew 

is a valid inequality for FSP(fc, n). Indeed, the left hand side is at most three 
by (2), while the right hand side is at least two. If the left hand side is equal to 
three, one of the nodes i\ or i 2 is mapped to w, while the other one is mapped 
to some j £ W. Hence 't/,; l7 ; 2 = yj w , so that the right hand side is at least three. 

Theorem 6. The constraint (13) induces a facet of FSP(k,n) if and only 
if\W\>2. 

5.2 The Fuzzy Reflection Polytope 

Next, we examine the polytope FSP(2,n) more closely. It is easy to see that for 
all odd subsets C of V the blossom constraint 

E < \C\ - 1 (14) 

ijec, ijtj 

is valid for the fuzzy reflection polytope. We have 

Theorem 7. Let C be an odd subset of V with |C| > 3. Then the blossom 
constraint (If) induces a facet of FSP(2 : n). 

The constraint (12) is valid for fuzzy reflections as well. However, we can 
improve it by adding x^i 2 to the left hand side: if Xq, 2 = 1, all other variables 
on the left hand side must be zero. Hence we get 

x iii 2 + 'y ' x iiji 4“ y 1 x iaja — 2 — Uiii 2 4~ ^ ' Vjija • (15) 

ji£Wi 32 6 W 2 ji€Wi, J 2 EW 2 



Theorem 8. The constraint (15) induces a facet of FSP(2,n) if and only if 

(a) |Wi| # 0 and \W 2 \ ^ 0, 

(b) \W 2 \ Wi\ ^ 1 and \W± \ W 2 \ ^ 1, and 

(c) |Wi UW 2 \ > 2. 

By Theorem 8, the constraint (3) is improved by the facet-inducing constraints 

x iiZ 2 4“ x iiji 4“ x iija 4” x iaii 4” x iaja — 2 i Uiii 2 4“ Vjij 2 • 

Next, observe that (13) is true for fuzzy reflections as well. Again, we can 
improve this constraint to 

^ x ii'i -2 4” y ( (, x ii j 4” x iaj) + 4“ — 3 Vi\i 2 4” y ( Ujw • (16) 

jew jew 

Theorem 9. The constraint (16) induces a facet of FSP(2,n) if and only 
if\W\>2. 
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6 The Branch &; Cut-Algorithm 

In the following, we roughly describe our branch & cut-algorithm based on the 
results of the previous sections. We restrict ourselves to separation and primal 
heuristics; for other topics including the branching strategy and rules for setting 
variables, see [3]. For branch & cut in general, see [4] or [3]. 

6.1 Separation 

We start the branch & cut-algorithm with the relaxation of (7) or (9) composed of 
all variables, the constraints (2), and, if k > 3, the constraint (6). The separation 
problem consists of finding inequalities that are valid for (7) or (9) but violated by 
the optimal solution of the current LP-relaxation. If we can find such inequalities, 
we add them to the relaxation and repeat the process. 

We first try to separate constraints of type (12), if k > 3, or (15), if k = 2. 
For this, we use a straightforward greedy heuristic: for fixed i\ and 12 , we add 
nodes to W\ or W% as long as the difference between the left and the right hand 
side of (12) or (15) can be increased (for the current LP-solution) . If we end up 
with any violated constraints, these are added to the LP-relaxation. 

Otherwise, we start separating constraints of type (13), if k > 3, or (16), 
if k = 2. It is easy to see that these constraints can be separated in polynomial 
time: for a fixed combination of i 1 , * 2 , and w , we add a node j to W if and only 
if the current LP- values of x u; - and Xi 2 j add up to more than the one of yj W . 

If we did not find any violated constraints of type (13) or (16), we separate the 
different kinds of orbit constraints: for k > 3, we search for violated constraints of 
type (10) or (11), which can be done in polynomial time by submodular function 
minimization. The same is true for the blossom constraints (14) for k = 2. If we 
still did not find any violated constraints, we start branching. 

6.2 Primal Heuristics 

Primal heuristics exploit the optimal solution of the current LP-relaxation in 
order to find feasible (but not necessarily optimal) solutions of the original ILP. 
In our algorithm, we use two heuristics working in an opposite way. 

The first one starts with determining a symmetry of the given order k of the 
complete graph K n that is close to the current (possibly fractional) values of the 
mapping variables. For example, we can traverse the mapping variables x-ij in 
descending order according to their value and let i be mapped to j if and only 
if no other node has been mapped to j before and i has not been mapped to 
any other node before. Once having fixed the node permutation, we can easily 
determine optimal values for all edge variables. 

The second heuristic first fixes the edges of G' by rounding the values of all 
edge variables. Then the mapping variables x i: j are again traversed in descending 
order according to their value; the node i is mapped to j if and only if this 
contradicts neither earlier mapping decisions nor the fixing of edges in G ’ . Even 
if this strategy only yields a partially defined permutation of V, we can still use 
its value as a bound on the optimal solution of the ILP. 
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6.3 Runtime Results 

Our branch & cut-algorithm for fuzzy symmetry detection is not fully developed 
yet. It is necessary and certainly possible to improve it significantly by future 
work. Nevertheless, we experimented with an implementation based on the poly- 
hedral results obtained so far. In this evaluation, we searched for fuzzy reflections 
and fuzzy rotations of order n. The penalty for every modification of the graph 
was set to one. In the reflection case, any fixed node also involved a penalty of 
one, i.e., one edge modification was allowed to prevent one fixed node. 

For all evaluations, we used an AMD 1400 MHz Athlon processor. Runtime 
results are given in CPU-seconds. Our implementation is based on ABACUS [1] 
in combination with CPLEX [7] . We used random simple undirected graphs for 
our experiments. Each pair of different nodes is adjacent by a probability of 1/4. 
Observe that random graphs are hard instances for fuzzy symmetry detection, 
since in general a lot of modifications are necessary to make them symmetric. 
For every number n of nodes, we tested 100 graphs in each evaluation. 

Compared with our results for exact symmetry detection presented in [5], 
runtimes for fuzzy symmetry detection are rather disappointing. The results 
for reflection detection are displayed in Table 1. We list average and maximal 
runtime, number of subproblems, and number of LPs. Since we can use our 
algorithm heuristically by stopping it at any time, we also list the CPU-time 
needed to find an optimal solution — without knowing its optimality at this point. 



Table 1 . Results for fuzzy reflection detection 



n 


runtime 
avg max 


opttime 
avg max 


#subprobs 
avg max 


#LPs 
avg max 


8 


0.04 


0.32 


0.00 


0 


4.0 


25 


13.2 


79 


9 


0.13 


1.58 


0.01 


1 


00 

bo 


57 


28.9 


201 


10 


0.53 


3.15 


0.04 


2 


20.7 


95 


68.8 


317 


11 


2.01 


14.46 


0.43 


5 


48.4 


263 


160.9 


894 


12 


11.92 


75.08 


1.86 


49 


167.5 


931 


567.8 


3185 



Table 1 shows that the runtime, the number of subproblems, and the number 
of LPs increase sharply already for small graphs. However, the time needed to 
find the optimal solution is much shorter than the total runtime, i.e., we know 
the optimal solution much earlier than the fact that it is optimal. We conclude 
that the primal heuristics used in our algorithm work well, whereas the cutting 
planes and separation algorithms must be improved in order to get a practically 
useful algorithm. In fact, a lot of violated cutting planes were usually found and 
added without changing the objective value of the optimal LP-solution. 

The results displayed in Table 1 are much more homogeneous than those for 
exact symmetry detection; see [5]. This is even more evident if we split up the 
results by the optimal objective function value, i.e., by the minimial number of 
modifications and fixed nodes; see Table 2 for n = 12. 
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Table 2. Results for fuzzy reflection detection, n = 12 



obj 


% of 
insts 


runtime 
avg max 


opttime 
avg max 


#subprobs 
avg max 


#LPs 
avg max 


1 


6 


0.17 


0.46 


0.00 


0 


9.0 


21 


26.2 


62 


2 


17 


1.18 


5.81 


0.59 


5 


30.5 


97 


99.4 


333 


3 


30 


4.12 


10.85 


1.03 


8 


75.0 


155 


249.0 


489 


4 


33 


13.44 


24.32 


1.33 


15 


192.6 


311 


660.5 


1031 


5 


13 


40.72 


60.22 


7.77 


49 


511.0 


791 


1729.1 


2526 


6 


1 


75.08 


75.08 


0.00 


0 


931.0 


931 


3185.0 


3185 



Table 3. Results for fuzzy n-rotation detection 



n 


runtime 
avg max 


opttime 
avg max 


#subprobs 
avg max 


#LPs 
avg max 


8 


0.10 


1.56 


0.00 


0 


12.1 


135 


28.6 


299 


9 


1.67 


45.07 


0.03 


2 


87.8 


1239 


232.6 


3775 


10 


6.93 


133.58 


0.46 


7 


165.7 


2251 


541.4 


8044 


11 


23.34 


414.23 


2.25 


56 


568.0 


9867 


1707.9 


29759 


12 


305.64 


15.0% 


35.60 


711 


2616.1 


16785 


8692.9 


56342 



Table 4. Results for fuzzy reflection detection, deleting edges only 



n 


runtime 
avg max 


opttime 
avg max 


#subprobs 
avg max 


#LPs 
avg max 


8 


0.02 


0.14 


0.00 


0 


3.1 


15 


9.0 


38 


9 


0.05 


0.24 


0.00 


0 


5.9 


29 


17.5 


59 


10 


0.10 


0.35 


0.00 


0 


8.6 


33 


24.0 


70 


11 


0.35 


1.53 


0.05 


1 


16.7 


53 


48.7 


156 


12 


0.84 


4.39 


0.06 


2 


28.8 


111 


85.0 


327 


13 


4.68 


20.13 


1.23 


11 


85.5 


273 


260.3 


846 


14 


8.19 


26.94 


1.22 


8 


122.7 


371 


380.1 


1112 


15 


51.86 


161.30 


11.95 


124 


405.5 


1149 


1280.5 


3623 


16 


79.16 


219.41 


10.05 


107 


554.9 


1419 


1791.2 


4521 



The results displayed in Table 2 reveal a strong connection between the 
hardness of an instance and its distance from being rcflectional symmetric. This 
is good news, as the fuzzy symmetry detection algorithm is designed to draw 
nearly symmetric graphs. 

Now consider the case k = n. As shown in Table 3, average runtimes for fuzzy 
rotation detection are even longer — by a factor of more than 25 for n = 12. For 15 
instances, we could not even find a provably optimal solution within one hour 
of CPU-time. One reason may be the larger number of mapping variables in the 
fuzzy rotation ILP compared with the fuzzy reflection ILP. 

To investigate the effect of the number of variables more closely, we finally 
evaluated a variant of the fuzzy reflection detection algorithm: we only allowed to 
delete edges but not to create them. In the corresponding ILP, we only needed an 
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edge variable for each adjacent node-pair in the original graph. By construction 
of our test set, we could thus save about 3/4 of the edge variables. Table 4 
shows that a smaller number of edge variables can decrease runtime significantly. 
Compared with the results given in Table 1, we observed an improvement of 
average runtime by a factor of more than 14 for n = 12. 

In summary, if used as an exact algorithm, our branch & cut-method only 
works for small graphs yet. Its performance is much better for nearly symmetric 
graphs than for random graphs; restricting the set of allowed modifications also 
helps. We are optimistic that the runtime can be improved significantly by future 
work. In fact, for exact symmetry detection, we were able to accelerate our 
branch & cut-algorithm by a factor of up to 50 until now, compared with [4]. 
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Abstract. We study barycentric placement of vertices in periodic 
graphs of dimension 2 or higher. Barycentric placements exist for ev- 
ery connected periodic graph, are unique up to affine transformations, 
and provide a versatile tool not only in drawing, but also in computation. 
Example applications include symmetric convex drawing in dimension 2 
as well as determining topological types of crystals and computing their 
ideal symmetry groups. 



1 Introduction 

Periodic graphs provide convenient models for studying the topology of the inter- 
atomic structure of crystal species. Usually, atoms are represented by vertices, 
while bonds are represented by edges. Some crystals, however, are made from 
larger building blocks using linkers, in which case it is often more useful to model 
this meta-structure [OEL+OO]. 

Apart from their relevance for chemistry, periodic graphs occur as the 1- 
skeleta of periodic tilings [DDH+99] and have applications in VLSI circuit design 
and integer programming [Orl84,CM91]. 

Although periodic graphs may be described in a finite way, algorithms for 
finite graphs do not usually carry over automatically. For example, precise practi- 
cal methods for determining isomorphism of periodic graphs have been missing 
up until now, although much searched for by chemists and crystallographers 
[Kle87]. 

Several authors [OB92,TRR + 97,DDH + 99], have constructed many examples 
of potential crystal structures by means of combinatorially enumerating periodic 
graphs. In addition to the isomorphism problem, this gives rise to a demand for 
chemically reasonable 3-dimensional drawing algorithms. 

Barycentric placements have been introduced by Tutte [Tut60,Tut63] for con- 
structing convex drawings of “almost” 3-connected planar graphs. Here, an outer 
face is chosen and realized as an arbitrary convex polygon. Tutte showed that 
for any such choice a barycentric placement of the remaining vertices uniquely 
exists. If, moreover, the only pairs of vertices at which the graph can be cut are 
situated on different sides of the outer polygon, it induces an embedding with 
convex inner faces and the maximal possible symmetry which is compatible with 
the outer polygon. 

We obtain a general existence and uniqueness result for connected, periodic 
graphs in general and a convexity result similar to Tutte’s for 3-connected, 2- 
periodic, planar graphs. It should be noted that barycentric drawings have been 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 178-189, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 
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employed by crystallographers for quite a while, but the author is not aware of 
any previous rigorous justification of that practice. 

We show further that for a class of graphs which we call stable, a number of 
algorithmic problems can be solved in polynomial time using exact barycentric 
drawings as a tool. 

In [DelOl], barycentric drawings for periodic graphs are treated mathemati- 
cally, with only a brief glance at computational issues. Here, we skip the more in- 
tricate mathematical details, while concentrating on the algorithmic and graph- 
ical aspects. The next section introduces periodic graphs. Section 3 defines 
barycentric drawings and gives some basic results. The two-dimensional case 
is treated in Section 4, and an algorithm for finding canonical forms and sym- 
metries in Section 5. Section 6, finally, discusses some applications. 



2 Periodic Graphs 

A d-periodic graph is a simple undirected graph on which the group Z d of d- 
dimensional integer vectors acts by automorphisms. The action is to be free, i.e., 
no vector may map any vertex onto itself, and there may be only finitely many 
orbits on the set of vertices as well as on the set of edges. Consequently, the 
set of vertices must be countable and the graph must be locally finite, i.e., only 
finitely many edges may be incident to any given vertex. 

We interpret the result of applying an integer vector to a vertex or edge as a 
formal translation and write v + s or e + s, respectively, where s = (si, . . . , Sd), 
and v + (s + t) = (v + s) + t and so on. 

An isomorphism between periodic graphs G = (V. E ) and G' = (V 7 , E') is an 
ordinary graph isomorphism, say /, which respects the Z d action. This means 
that for each vector s £ Z d , there is another vector, s', such that f(v + s) = 
/( v) + s' for any vertex v £ V. It is easy to see that the function /*: Z d Z d 
which maps every vector s to its corresponding s' must be linear and bijective, 
thus it can be represented by a d x d unimodular matrix (a matrix with integer 
entries and determinant 1 or —1). 

A natural way to encode periodic graphs is the so-called vector or (labelled) 
orbit graph method [CHK84]: let n be the number of vertex orbits and m the 
number of edge orbits under the Z d action. Choose arbitrary representatives 
v\, . . . ,v n of the vertex orbits. Now each vertex v £ V can be written uniquely 
as Vi + s for some i £ {1, . . . , n} and some s £ Z d . For every edge orbit, there is a 
unique representative e = (vi, Vj + t), where either i < j or else i = j and the first 
nonzero entry of t is positive. Specifying all these edge representatives completely 
characterizes a periodic graph. Consequently, a periodic graph of dimension d 
with m edge orbits is given as an array of m x (d + 2) numbers, where the first 
two entries in each row are indices of vertex representatives, while the remaining 
d entries form a vector, which we call the shift vector for this edge, and by which 
the second vertex is to be shifted to obtain the second end of the edge. 
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Here are a few tiny examples: 



1110 
110 1 



12 00 
and 12 10 
12 0 1 



encode the 1-skeleton of a face-to-face tiling by regular squares and the 1-skeleton 
of a face-to-face tiling by regular hexagons, respectively. 

Their 3-dimensional analogues are 



1110 0 
110 10 
110 0 1 



and 



1 2 0 0 0 

12 10 0 

12 0 10 

1 2 0 0 1 , 



the 1-skeleton of a face-to-face tiling by regular cubes and the atom-bond network 
of diamond. 

A vector representation is naturally interpreted as a finite directed multi- 
graph with edges labelled by shift vectors, where vertices, edges and their in- 
cidences represent vertex and edge orbits and their incidences in the original 
graph, hence the alternative term orbit graph. 

For each periodic graph, there are infinitely many choices of vertex represen- 
tatives. There are, moreover, infinitely many ways of choosing a basis for Z d , 
a choice which determines the actual coordinates used in specifying the shift 
vectors. An obvious improvement is to require that the set of representatives 
vi,...,v n , or rather the subgraph induced by them, be connected, if possible. 
This leaves, up to translations, only finitely many choices for the set of vertex 
representatives. It is easy to see that a periodic graph is connected if and only if 
it has a connected set of vertex representatives and for any such set the induced 
set of shift vectors spans Z d [CM91]. It should be possible to formulate addi- 
tional conditions on the set of shift vectors in order to arrive at a finite set of 
permissible representations. Note, however, that the set of shift vectors need not 
contain an integral basis for Z d . Even in the 1-dimensional case, the “vectors” 2 
and 3 form a counterexample. 

In the following, we will assume that all periodic graphs under consideration 
are connected. 



3 Barycentric Drawings and Isomorphisms 



It is reasonable to require that a drawing of a periodic graph shall display 
the periodicity. In the following, we will restrict our attention to straight 
line drawings. Consequently, a drawing is determined by a vertex placement 
p: {t>i, . . . , v n } — > and a linear, nonsingular function p*:Z d — > R d , where the 

extension of p to V is given by p(vi + s) := p(vi ) +p*(s). 

A barycentric drawing is one which satisfies 



p(v) 



1 

Mv)\ 



Y p(w), 

w£N(v) 
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where N(v) denotes the set of all vertices adjacent to v. This definition, quite 
obviously, is invariant under affine transformations, i.e. , if p* is changed or the 
whole graph is shifted by some vector, a barycentric drawing remains barycentric. 

If p* and the position of an arbitrary vertex are preassigned, say p* = Id^ 
and p{y i) = 0, there is a unique barycentric drawing satisfying these conditions. 
Consequently, barycentric drawings always exist for connected, periodic graphs 
and are unique up to affine transformations. 

To see why, consider the “energy function” 

E (P) ■= W’PW) 2 - 

vw(zEo 

where d denotes Euclidean distance and Eq is a set of representatives for the 
edge orbits. It is straightforward to see that p is barycentric if and only if p, 
taken as a point in R” d , is critical for E, i.e., all partial derivatives vanish. Now 
consider 

Ibll 2 = bM | 2 + ... + bK)l 2 - 

Because (V. E) is connected and p(v i) = 0 , the maximum edge length can be 
made arbitrarily large by just taking any p with ||p|| 2 large enough. Now E is a 
quadratic function, which by the above is non-degenerate in each direction, thus 
must have a unique critical point, a minimum (c.f. [RG96]). 

This does not only show the existence and uniqueness of barycentric draw- 
ings, but also points to an aesthetically interesting property, namely that 
barycentric drawings minimize the sum of squared edge lengths among all peri- 
odic drawings with a given p*. 

In the following, we will always use p to denote the barycentric vertex place- 
ment with p* = Idrf and p(v i) = 0. 

We call a graph stable if p is injective, thus vertices can be identified uniquely 
by their positions. This is a key to lowering the complexity of several compu- 
tational problems dealing with periodic graphs. Where traversal algorithms are 
employed, it is usually enough, though, to require local stability, meaning that 
p need only be injective on N(v) for each vertex v. 

For display purposes, p can be conveniently approximated by iterative, force 
directed methods (c.f [Ead84,FR91]), where all forces are attractive, only exist 
between neighbors and are proportional in size to the respective edge lengths. In 
order to identify vertices by their positions, however, the linear system of equa- 
tions induced by the barycentricity condition has to be solved exactly, i.e., using 
arbitrary precision rational arithmetic. This is necessary because in principle, 
vertices positions can get arbitrarily close without being identical. 

Next, we claim that an isomorphism between periodic graphs induces an 
affine mapping between their barycentric drawings. To see this, let / be an iso- 
morphism between graphs (V, E) and (V ' , E') with barycentric placements p and 
p', respectively. Define g*(s) := p , *(/*(p* -1 (s))) and consider new placements qi 
and q 2 on V, where <?i(u) := p'(f{v)) and q 2 (v) = g*(p(v))+p'(f(v 1 ))-g*(p(v i)). 
It is then an easy exercise to see that qi(vi) = q 2 (v\) = p'(f(v i)) and 
qi(v + s) — qi{v) = q 2 (v + s) — q 2 {v) = g*(p*(s)) for any s £ Z d . Because / is an 
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isomorphism and all the •* functions are linear, both q\ and (72 are barycentric 
drawings, so they must be identical. 

We state this as our first theorem: 

Theorem 1. If f is an isomorphism between connected periodic graphs (V,E) 
and (V',E') with barycentric vertex placements p and p' , respectively , then there 
is an affine function p such that p'(f(v)) = p(p(v)) for all v £ V . 

Theorem 1 has a nice consequence regarding automorphisms of periodic 
graphs: 

Theorem 2. If (V, E) is a connected periodic graph, there is a linear function 
p *: 7L d — > 7L d such that for any barycentric placement satisfying p(v + s ) = p(v) + 
p*(s) for all v £ V and all s € 7L d and any automorphism f of ( V,E ), there is 
an isometry p ofM. d with respect to the Euclidean metric, such that p(f(v)) = 
p(p(v)) for all v £ V . 

What this means, essentially, is that the full automorphism group of a peri- 
odic graph can be realized as a crystallographic space group. There can be only 
finitely many conjugacy classes of automorphisms with respect to the h d action. 
Turning a finite set of affine maps into isometries by redefining the metric, or 
equivalently, the basis with respect to which they act, is a standard trick in 
mathematical crystallography, see for example [Sch80]. 

For stable periodic graphs, every nontrivial automorphism must move some 
vertex, so the mapping from automorphisms to isometries is injective. Conse- 
quently: 

Corollary 1. The automorphism group of any stable, connected, periodic graph 
is isomorphic to a crystallographic space group. 

For unstable graphs, some nontrivial automorphism may be mapped to the 
identity. As an example, see Figure 1, where any barycentric placement must map 
pairs of adjacent white vertices to the same position, so that the automorphism 
that just flips each such pair induces the identity on K d . 

■OjO' 

<o|<l>- 

Fig. 1. An unstable 2-periodic graph. 
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4 Planar Graphs 

Tutte’s classical results on convex drawings of finite planar graphs [Tut60,Tut63] 
carry over to planar 2-periodic graphs. In particular, we have the following: 

Theorem 3. For a connected, 2-periodic, planar graph G, the following are 
equivalent: 

1. G is 3- connected . 

2. G has a convex embedding in the plane. 

3. G has a convex embedding in the plane such that every automorphism of G 
is realized by a crystallographic symmetry of the embedding. 

Here, as usual, an embedding of a graph is a drawing in which the vertices 
and interiors of edges are all pairwise disjoint. Whenever the context is clear, we 
do not distinguish in terminology between vertices or edges and their images in 
a drawing. 

A similar convexity criterion has been given by Thomassen [Tho80] for in- 
finite planar graphs in general, but does not imply anything with respect to 
symmetries. Obviously, (3) implies (2), and, employing an easy lemma stating 
that any finite vertex cut in a periodic graph has exactly one infinite component, 
it is straightforward to see that ( 2 ) implies ( 1 ). 

The implication (1) — > (3) follows from a result in [DelOl] (Theorem 20), 
namely: 

Theorem 4. The straight line drawing induced by any good, periodic vertex 
placement of a 2-periodic, 3- connected , planar graph is a convex embedding. 

Here, a vertex placement is called good if every vertex v is placed in the 
relative interior (c.f. [RG96]) of its neighbor’s positions, defined as 



The proof, which we are going to sketch here briefly, was largely inspired by 
Richter-Gebert’s treatment [RG96] of the classical Tutte Theorem. 

The first step is to show that the neighbors of a given vertex v can never be 
placed on a common straight line h. For, assume otherwise and let, without loss 
of generality, h be horizontal. Consider the largest connected subgraph S of G 
containing v with all vertices placed on h. Then S, by periodicity, can not be the 
whole of G. Because of the 3-connectedness, and using Menger’s theorem, there 
must be at least 3 vertices a, b and c in S with neighbors not on h. Because the 
placement is good, each of these has neighbors both above and below h. Using 
the periodicity again, one can show that those above h are pairwise connected 
by paths staying above h. Likewise, those below h are connected below h. But 
then we have 3 independent ways of interconnecting vertices a, b and c, one on, 
one above and one below h, which establishes a K 33 minor in G. 





w£N(v) 



w£N(v) 
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Consequently, in a good placement, every vertex falls inside the convex hull 
of its neighbors, which must be non-degenerate. The next step is to prove the 
theorem for triangulations without degenerate triangles. By summing up angles 
in two different ways, first around vertices and then inside triangles, and using 
an Euler characteristic argument, one shows that the triangles are all oriented 
consistently and that angles sum up to 27 r around each vertex. The simple- 
connectedness of the plane then implies that there can be no overlaps between 
triangles. 

Finally, the theorem is extended to the general case by first triangulating 
each facial cycle of the given graph without introducing new vertices. This can 
be done in such a way that the h d images of a facial cycle are triangulated 
consistently, making the resulting triangulated graph G' periodic again. If p is 
a good placement for G , it is also a good placement for G' . A perturbation 
argument is used to show that no degenerate triangles can occur. Consequently, 
p must induce a straight-line embedding of G' and thus of G. But because p is 
good, all interior facial angles occurring in the induced embedding of G are less 
than 7 r, so all the faces are convex. 

A nice feature of Theorem 4 is that it allows for some flexibility in the con- 
struction of drawings. Tutte’s barycentric construction results in aesthetically 
pleasing pictures, but is also well-known for allowing a rather large variation 
in edge lengths, making some graph structures hard to recognize. This can be 
remedied to some extend by using a force-directed drawing method with the 
barycentric placement as its initial value. Such methods can be adjusted to sup- 
port a more uniform distribution of edge lengths, for example by using attractive 
forces proportional to some power larger than 1 of the edge lengths. As long as 
the process is symmetric and carefully tuned as to keep the result of each itera- 
tion good, the final drawing will still be convex and display the full symmetry. 
The author has first employed this idea to polyhedra graphs, for which an ana- 
logue of Theorem 4 holds [RG96]. Figure 2 shows an example. 






Fig. 2. Convex drawings of a large polyhedron graph. Left: barycentric drawing. Middle 
and right: variants with attractive forces proportional to the square and fourth power 
of the edge length. 
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A more general approach of relaxing the barycentric drawing would define an 
appropriate penalty function and use optimization methods such as simulated 
annealing or downhill simplex to search for a minimum. In Figure 3 we show three 
drawings of the same periodic graph, all derived from variations and relaxations 
of the barycenter construction. 




Fig. 3. Three drawings of a periodic planar graph, inspired by a Moorish pattern. 
Left: barycentric drawing. Middle: relaxed barycentric drawing with equal edge lengths. 
Right: vertices on mirror planes were spaced evenly, then those inside the fundamental 
triangle were placed using a barycentric drawing of the second derived subdivision. 



5 Symmetries and Canonical Forms 

Isomorphism of two structures can be tested in linear time when these are given 
in a canonical form, i.e. , a representation that uniquely depends on the isomor- 
phism type. Therefore, an efficient canonical forms algorithm is highly desirable. 

A vector representation of a periodic graph as in Section 2 can be converted 
into a unique linear string of numbers by first sorting the rows lexicographically 
and then concatenating them into a single row. Our approach towards canonical 
forms is to efficiently construct a small characteristic set of vector representa- 
tions and pick a minimal one with respect to the lexicographic order of their 
corresponding strings. 

A straightforward way to obtain a finite set is by picking vertex represen- 
tatives as they occur in, say, a breadth first traversal of the graph. To make 
the resulting set characteristic, every possible order of picking vertices has to be 
executed, which may easily lead to an exponentially large set of representations. 

We use barycentric placements to arrive at a smaller set of traversals. Let p 
be a barycentric placement and recall that a graph is called locally stable if for 
each vertex v the function p is injective on the set N(v) of v's neighbors. Then 
a traversal must pick these in lexicographical order of their coordinates as given 
by p, with respect to some appropriate basis. Because p is unique up to affine 
transformations, there is a unique breadth first traversal of this type for every 
choice of start vertex and coordinate basis. 

It remains to pick a characteristic set of pairs, each consisting of a vertex 
and a basis. The term characteristic means that these must depend only on the 





186 O. Delgado-Friedrichs 



isomorphism type of G , not on its original representation. A natural choice is an 
ordered d-tuple of edge vectors. There are 2m of these in a graph with m edge 
orbits, thus (2 m) d potential bases, which can be readily determined from a given 
vector representation. Of course, only linearly independent tuples can be used, 
but by the d-periodicity and connectedness of G, at least one such exists. As a 
start vertex for the traversal, the first vertex of the first directed edge is used. 

If during the traversal, a vertex is encountered which belongs to a new orbit, 
it is made the representative of this orbit and an edge with trivial shift vector 
is added to the new representation. If the orbit has already been seen, an edge 
with a non-trivial shift vector is added. These shift vectors are expressed in the 
basis used for the traversal, which is determined by the original tuple of edge 
vectors and in general is not an integral basis of the translation lattice. Thus, in 
the end, a basis for the integral span of the shift vectors must be computed in 
a deterministic way and the shift vectors expressed in terms of that basis. The 
complete algorithm is shown as Algorithm 5 below. 

It is straightforward to extend the algorithm to computing the automor- 
phisms as well. Indeed, whenever two d-tuples of directed edges lead to the same 
representation, an automorphism of the periodic graph G has been found and 
all automorphisms must occur in this way. 

The runtime analysis of the Algorithm is complicated by the fact that exact 
rational numbers have to be used in all the calculations involving vertex posi- 
tions. These arise from the solution of a linear system of equations AX = B, 
where A is an n x n integer matrix, B is an n x d integer matrix and X is an n x d 
rational matrix. Here, the *-row of A corresponds, of course, to the barycenter 
condition for the i-th vertex representative. It is not hard to show that the log- 
arithm of the numerators and denominators of entries in X is 0(n log n) if the 
entries in A and B are assumed to be constant in size, say, representable by 
machine words. In practice, we may then also assume that n is representable by 
a machine word, so the above boils down to O(n). It turns out that the maximal 
representation size of any rational number occurring during the execution of the 
algorithm rises to O(dn). 

Ignoring the complexity of arithmetic for the moment, the main loop of Al- 
gorithm 5 runs in time 0(dm d+1 log D), where m, once again, is the number 
of edge representatives and D is the maximal degree of a vertex. Clearly, the 
main loop is executed (2m ) d times and the time spent per iteration is dominated 
by sorting the transformed shift vectors at each step of the traversal and accu- 
mulates to 0(dm log D). Everything else, including the preparations before the 
execution of the main loop, takes time 0{dm). 

It remains to consider the final steps after the main loop. In order to find a 
basis for the integral span of the shift vectors, we need to triangulate the matrix 
formed by these using column operations which only add integer multiples of 
columns to other columns. The problem here is that it may take several steps 
to eliminate a single entry, which may lead to an overall doubly exponential 
growth in some of the other entries. Clearly, by multiplying with the least com- 
mon denominator of all entries, we can reduce the original problem to that of 
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Algorithm 5. Canonical Form. 

Input 

— A list E of edge representatives of the form (v, w, s ) with v, w £ N and s £ Z dxl (a 
d-dimensional column vector), representing a connected, locally stable, d-periodic 
graph G. 

— A barycentric vertex placement p for G. 

Output 

— A list C of edge representatives, constituting a canonical form for G. 



Set A = [ ] . 
for (v, w, s) in E: 

Append (v,w, (p w + s) — p v ) and (w,v,p v — (p w + s)) to A. 
for v = 1 to n: 

Let S v = [ (w, s ) | ( v , w, s) £ A ]. 

for each ordered d-tuple ((vi,wi, s i), . . . , (vd, mj, Sd)) from A: 

Let v = Vi. 

Let B be the d x d matrix with columns si, ... ,Sd- 
if B is singular: 

Skip to the next loop iteration. 

Set up an empty FIFO queue Q and enter v. 

Set I v = 1, q v = 0 and k = 2. 

Set N=[ ]. 

while Q is not empty: 

Pop v from Q. 

Set T = [ ( w , I3 _1 s) | (tu, s) £ S v ] and sort by second components. 
for (w, s) in T: 

if I w is undefined: 

Set I w = k, q w = q v + s and increase k. 

Enter w into Q. 

Set t — Qv — h S Qw • 

if I v < Iw or (I v = Iw and t > 0): 

Append (I v ,I w ,t) to N. 

if C is undefined or N < C: 

Set C = N. 

Let B be a deterministic basis for the integral span of [ s \ (v,w,s) £ C ]. 
for (v, w, s) in C: 

Set t = B~ 1 s. 
ifv = w and t < 0: 

Set t = —t. 

Replace (v, w, s) by (u, w, t) in C. 

Sort and return C. 
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triangulating a matrix over the integers. Bachem and Kannan [BK79] were the 
first to present an polynomial time algorithm for this latter problem. 

We conclude that, even after taking into account the extra cost involved in 
using precise rational number arithmetic, canonical forms for stable, connected, 
periodic graphs can be computed in polynomial time (assuming the dimension 
is taken to be constant) using barycentric vertex placement, and that two such 
graphs, given by their canonical forms, can be tested for isomorphism in time 
0(md). 

6 Applications 

The methods discussed here have been implemented in the programming lan- 
guage Python. The program, called Systre , is available from the author upon 
request. It contains a library for dealing with crystallographic groups in Dimen- 
sion 3, following the conventions of the so-called International Tables [Hah83]. 
Because crystal structures often display fewer translational symmetries than pos- 
sible, an algorithm for finding additional potential translations has been incorpo- 
rated as well. These can be recognized immediately as translational symmetries 
of the barycentric drawing. Systre computes both the ideal symmetry group and 




Fig. 4. A periodic tiling of 3-dimensional space with some non-planar faces. 



canonical form of a given periodic graph. The latter can be used as a database 
index. Graphs are specified by vector representations or, alternatively, by atom 
positions and unit cell parameters. Systre then tries to deduce the bonds. A 
relaxed drawing, aiming for uniform edge lengths and a large total volume can 
be produced, using the barycentric drawing as a starting point. 

In Section 4 we discussed barycentric placements for periodic, planar graphs 
and thus periodic tilings of the plane. Tilings of 3-dimensional space can be 
drawn similarly: first the 1-skeleton is embedded, using barycentric placement 
and force- models. Non-planar faces are then realized by first triangulating them, 
then barycentrically placing the inner vertices. This leads to nice, smooth-looking 
curved surfaces with the full symmetry of their respective boundaries. Figure 4 
shows an example. 
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Abstract. A three-dimensional grid drawing of a graph is a placement 
of the vertices at distinct points with integer coordinates, such that the 
straight line-segments representing the edges are pairwise non-crossing. 
A 0(n 3 / 2 ) volume bound is proved for three-dimensional grid drawings 
of graphs with bounded degree, graphs with bounded genus, and graphs 
with no bounded complete graph as a minor. The previous best bound 
for these graph families was <D(n 2 ). These results (partially) solve open 
problems due to Pach, Thiele, and Toth [Graph Drawing 1997] and Fel- 
sner, Liotta, and Wismath [Graph Drawing 2001]. 



1 Introduction 

A three-dimensional straight-line grid drawing of a graph, henceforth called a 
3D drawing , is a placement of the vertices at distinct points in Z 3 (called grid- 
points), such that the line-segments representing the edges are pairwise non- 
crossing. That is, distinct edges only intersect at common endpoints, and each 
edge only intersects a vertex that is an endpoint of that edge. In contrast to 
the case in the plane, it is well known that every graph has a 3D drawing. We 
are therefore interested in optimising certain measures of the aesthetic quality 
of such drawings. 

The bounding box of a 3D drawing is the minimum axis-aligned box contain- 
ing the drawing. If the bounding box has side lengths X — 1, Y — 1 and Z — 1, 
then we speak of an XxYx Z drawing with volume X-Y-Z. That is, the volume 
of a 3D drawing is the number of gridpoints in the bounding box. This definition 
is formulated so that 2D drawings have positive volume. We are interested in 3D 
drawings with small volume, which is a widely studied problem [3,4,5,6,7,8,10, 
11,12,14,18,24,25,27]. Three-dimensional graph drawings in which the vertices 
are allowed real coordinates have also been studied (see the references in [11]). 
The authors have also established bounds on the volume of three-dimensional 
polyline grid drawings, where bends in the edges are also at gridpoints [11]. Ta- 
ble 1 summarises the best known upper bounds on the volume of 3D drawings, 
including those established in this paper. 

* Research supported by NSERC and FCAR. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 190-201, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 



Three-Dimensional Grid Drawings 



191 



Table 1. Upper bounds on the volume of 3D drawings of n-vertex m-edge graphs. 



graph family 


volume 


reference 


arbitrary 


0(n a ) 


Cohen et al. [5] 


arbitrary 


0(m 4 / 3 n) 


Theorem 4 


maximum degree A 


O(Amn) 


Theorem 3 


constant maximum degree 


0(n 3/2 ) 


Theorem 10 


constant chromatic number 


0(n 2 ) 


Pach et al. [24] 


constant chromatic number 


0(m 2/3 n ) 


Theorem 6 


no A^-minor ( h constant) 


0(n 3/2 ) 


Theorem 9 


constant genus 


0(n 3/2 ) 


Theorem 8 


constant tree-width 


0(n) 


Dujmovic and Wood [12] 



Cohen et al. [5] proved that every graph has a 3D drawing with 0(n 3 ) volume, 
and that this bound is asymptotically optimal for complete graphs K n . Our edge- 
sensitive bounds of 0(m 4 ^ 3 n) and O(Amn) are greater than 0(n 3 ) in the worst 
case. It is an open problem whether there are edge-sensitive bounds that match 
the 0(n 3 ) bound in the case of complete graphs. 

Pach et al. [24] proved that graphs with constant chromatic number have 
3D drawings with 0(n 2 ) volume. For c-colourable graphs the actual bound is 
0(c 2 n 2 ). Our edge-sensitive bound of 0(m 2 ^ 3 n) is an improvement on this result 
for graphs with constant chromatic number and o(n 3,/2 ) edges. Pach et al. [24] 
also proved an I7(n 2 ) lower bound for the volume of 3D drawings of com- 
plete bipartite graphs K n n . This lower bound was generalised to all graphs by 
Bose et al. [3], who proved that every 3D drawing has volume at least | (n + m). 

Graphs with constant maximum degree have constant chromatic number, 
and by the result of Pach et al. [24], have 3D drawings with 0(n 2 ) volume. 
Pach et al. [24] conjectured that graphs with constant maximum degree have 
3D drawings with o(n 2 ) volume. We verify this conjecture by proving that such 
graphs have 3D drawings with 0[n 3 ^ 2 ) volume. 

The first 0(n) upper bound on the volume of 3D drawings was established 
by Felsner et al. [14] for outerplanar graphs. This result was generalised by the 
authors for graphs with constant tree- width [12]. Improved constants have been 
obtained in a number of special cases [6,7,8,18]. Felsner et al. [14] proposed the 
following inviting open problem: does every planar graph have a 3D drawing 
with 0(n) volume? In this paper we provide a partial solution to this question, 
by proving that planar graphs have 3D drawings with 0(n 3 / 2 ) volume. Note 
that 0(?r 2 ) is the optimal area for plane 2D grid drawings, and 0(n 2 ) was the 
previous best upper bound on the volume of 3D drawings of planar graphs. 

A graph H is a minor of a graph G if H is isomorphic to a graph obtained 
from a subgraph of G by contracting edges. The genus of a graph G is the min- 
imum 7 such that G can be embedded in the orientable surface with 7 handles. 
Of course, planar graphs have genus 0 and no A' 5 -minor. A generalisation of 
our result for planar graphs is that every graph with constant genus or with no 
Ah, -minor for constant h has a 3D drawing with 0{n 3 / 2 ) volume. 
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2 Track Layouts 

We consider undirected, finite, and simple graphs G with vertex set V ( G ) and 
edge set E{G). The number of vertices and edges of G are respectively denoted 
by n = |T(G)| and m = |1£(G)|. A vertex c-colouring of G is a partition {Vi : 
1 < i < c} of V(G), such that for every edge vw £ E(G ), if v £ Vi and w £ Vj 
then i ^ j. Each i £ {1,2, . . . , c} is a colour, and each set Vi is a colour class. 
At times it will be convenient to write col(u) = i rather than v £ V t . If G has a 
vertex c-colouring then G is c- colourable. The chromatic number of G, denoted 
by x(G), is the minimum c such that G is c-colourable. 

Let {Vi : 1 < i < c} be a vertex c-colouring of a graph G. Let <, be 
a total order on each colour class Vj. Then each pair (V,<i) is a track, and 
{(Ti,<i) : 1 < i < t} is a t-track assignment of G. To ease the notation we 
denote track assignments by { V : 1 < i < c} when the ordering on each colour 
class is implicit. An X- crossing in a track assignment consists of two edges vw 
and xy such that v <i x and y <j w, for distinct colours i and j. A t-track layout 
of G consists of a t-track assignment of G with no X-crossing. The track-number 
of G, denoted by tn(G), is the minimum t such that G has a t- track layout 1 

Track layouts and track-number were introduced in [10,12] although they 
are implicit in many previous works [14,19]. Track layouts and 3D drawings are 
closely related, as illustrated by the following result by Dujmovic et al. [10]. Also 
note that other authors have used track layouts to produce 3D drawings [6,8, 
18], and there is a tight relationship between track layouts and another type 
of graph layout called a queue layout [27], which is a dual structure to a book 
embedding introduced by Heath et al. [19]. 

Theorem 1 . [ 10 ] Every n-vertex graph G with track-number tn(G) < t has a 
2t x 4t x 4f[j] drawing, which has 0(t 2 n ) volume. Conversely, if a graph G has 
an X x Y x Z drawing then G has track-number tn(G) < 2 XY. 

We have the following upper bounds on the track-number. 

Lemma 1 . Let G be a graph with n vertices, maximum degree A, path-width 
p, tree-width w, genus 7, and with no Kh-minor. Then the track-number of 
G satisfies: (a) tn(G) < p + 1, (b) tn(G) < 0(6 4 “), (c) tn(G) < 72 wA, 

(d)tn (GJeOt^n 1 / 2 ), (e) tn (G) e Of^n 1 / 2 ) . 

Proof. Part (a) is by Dujmovic et al. [10]. Parts (b) and (c) are by the au- 
thors [12]. Gilbert et al. [16] and Djidjev [9] independently proved that G has 
a G(7 1 / 2 n 1 / 2 )-separator, and thus has G(7 1 ^ 2 n 1//2 ) path-width (see Bodlaen- 
der [2, Theorem 20(iii)]). Hence (d) follows from (a). Similarly (e) follows from 
the result by Alon et al. [1] that G has a 0(/i 3 / 2 n 1//2 )- separator. □ 

The next result is the fundamental contribution of this section. 

Theorem 2. Every graph G with m edges and maximum degree A has track- 
number tn(G) < 1A'/Am. 



1 This definition of track-number is unrelated to that of Gyarfas and West [17]. 
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To prove Theorem 2 we introduce the following concept. A vertex colouring 
is a strong star colouring if between every pair of colour classes, all edges (if any) 
are incident to a single vertex. That is, each bichromatic subgraph consists of a 
star and possibly some isolated vertices. The strong star chromatic number of a 
graph G, denoted by Xsst (G), is the minimum number of colours in a strong star 
colouring of G. Note that star colourings , in which each bichromatic subgraph is 
a forest of stars, have also been studied (see [23] for example). The star chromatic 
number of a graph G, denoted by Xst (G) , is the minimum number of colours in 
a star-colouring of G. With an arbitrary order on each colour class in a strong 
star colouring, there is no X-crossing. Thus track-number tn(G) < Xsst(G) for 
every graph G, and Theorem 2 is an immediate corollary of the next lemma. 

Lemma 2. Every graph G with m edges and maximum degree A > 1 has strong 
star chromatic number Xsst(G) < 14V Am. 

The proof of Lemma 2 uses the weighted version of the Lovasz Local Lemma 

[13]- 

Lemma 3. [22, p. 221] Let £ = {Ai, . . . , A n } be a set of ‘bad’ events. Let 
0 < p < | be a real number, and let t\, . . . , t n > 1 be integers. Suppose that for 
all Ai £ £ , 

(a) the probability P (Af) < p**, 

(b) Aj, is mutually independent of £ \ ({A^} U Df) for some Di C £, and 

(c) y, ^ ^ | ■ 

AjGDi 

Then with positive probability, no event in £ occurs. 

Proof (of Lemma 2). Let c > 4 be a positive integer to be specified later. Let 
p = 1. Then 0 < p < {. For each vertex v £ V[G), randomly and independently 
choose col (v) from {1,2,..., c}. For each edge mu £ E(G), let A vw be the type-I 
event that col(v) = col(ui). Let E' be the set of arcs E' = {(u,w), (w,v) : vw £ 
E(G)}. For each pair of arcs [v, w), ( x , y) £ E' with no endpoint in common, let 
B{v, w ),{x,y) be the type-II event that col(u) = col (a;) and col(ru) = col(j/). 

We will apply Lemma 3 to obtain a colouring such that no type-I event and 
no type-II event occurs. No type-I event implies that we have a (proper) vertex 
colouring. No type-II event implies that no two disjoint edges share the same 
pair of colours; that is, we have a strong star colouring. For each type-I event A, 
P(A) = 1. Let tA = 1. Then P(A) = p tA . For each type-II event B, P (B) = 4j-. 
Let ts = 2. Then P (B) = p tB . Thus condition (a) of Lemma 3 is satisfied. 

An event involving a particular set of vertices is dependent only on other 
events involving at least one of the vertices in that set. Each vertex is involved 
in at most A type-I events, and at most 2Z\|i? , | = 4Z\m type-II events. A type-I 
event involves two vertices, and is thus mutually independent of all but at most 
2Z\ type-I events and at most 8 Am type-II events. A type-II event involves four 
vertices, and is thus mutually independent of all but at most 4 A type-I events 
and at most 16Z\m type-II events. 
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For condition (c) of Lemma 3 to hold we need 2 A ( 2 ) 1 +8 Am ( 2 ) 2 < -[for the 
type-I events, and 4 A + 16Z\m ( 2 ) 2 < 1 for the type-II events. It is a happy 
coincidence that these two equations are equivalent, and it is easily verified that 
c = [4(A + y/Z\(l + 4m) )] > 4 is a solution. Thus by Lemma 3, with positive 
probability no type-I event and no type-II event occurs. Thus for every vertex 
v € V (G), there exists col (i>) € {1, . . . , c} such that no type-I event and no type- 
II event occurs. As proved above such a colouring is a strong star colouring. 
Since A < \J Am, the number of colours c < [4(1 + y/5)\/ Arn\ < 14 '/Am. □ 

Theorems 1 and 2 imply: 

Theorem 3. Every graph with n vertices, m edges and maximum degree A has 
a O^Am) 1 / 2 ) x ©((Am) 1 / 2 ) x 0(n) drawing with O(Amn) volume. □ 

We have the following corollary of Lemma 2. 

Lemma 4. Every graph G with m edges has strong star chromatic number 
Xsst(G) < 15m 2 / 3 . 

Proof. Let X be the set of vertices of G with degree greater than ^m 1 / 3 . Let 
H be the subgraph of G induced by 14(G) \ X. Thus H has maximum de- 
gree at most jm 1 / 3 . By Lemma 2, H has a strong star colouring with at most 
14(|m 1 / 3 m) 1 / 2 = 7m 2 / 3 colours. Now \X\ < 2 m/(|m 1 / 3 ) = 8m 2 / 3 . By adding 
each vertex in X to its own colour class we obtain a strong star colouring of G 
with at most 15m 2 / 3 colours. □ 

Since tn(G) < x sst (G), Lemma 4 implies that tn(G) < 15m 2 / 3 , and by 
Theorem 1 we have: 

Theorem 4. Every graph with n vertices and m edges has a G(m 2 / 3 ) x 
G(m 2 / 3 ) x 0(n) drawing with G(m 4 / 3 u) volume. □ 

3 Sub-quadratic Volume Bounds 

Vertex colourings [24] and track layouts [10] have previously been used to pro- 
duce 3D drawings with small volume. In the following sequence of results we 
combine vertex colourings and track layouts to reduce the quadratic dependence 
on t in Theorem 1 to linear. This comes at the expense of a higher dependence 
on the chromatic number. However, in the intended applications the chromatic 
number will be constant, or at least will be independent of the size of the graph. 
The proof of the next lemma is a further generalisation of the ‘moment curve’ 
method for producing three-dimensional graph drawings [5,24,10]. 

Lemma 5. Let G be a graph with a vertex c-colouring (V) : 0 < i < c — 1}, and 
a track layout {Tij :0<*<c — 1, 1 < j < ti}, such that each Tjj C V). Then G 
has a O(c) x 0(c 2 t) x 0(c 5 tn') drawing, where t = rn.ax.ti and n! = ma.x|Tjj|. 
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Proof. Let p be the minimum prime such that p > c. Then p < 2c by Bertrand’s 
postulate. Let v(i,j,k) denote the k th vertex in track T i r Define 

Y(i,j) = p(2it + j) + ( i 2 mod p), and 

Z(i,j , k) = p(20cin' ■ Y{i,j) + k) + (i 3 mod p) . 



Position each vertex v(i,j,k) at the gridpoint (i,Y(i,j),Z(i,j,k)), and draw 
each edge as a line-segment between its endpoints. Since Y(i,j) € 0{c 2 t) and 
Z(i,j, k) e 0(c 3 n' ■ Y(i,j)), the drawing is O(c) x 0(c 2 t) x 0(c 5 tn'). 

Observe that the tracks from a single colour class are within a distinct Y Z- 
plane, each track occupies a distinct vertical line, and the ^-coordinates of the 
vertices within a track preserve the given ordering of that track. In addition, the 
F-coordinates satisfy the following property. 

Claim 1. For all distinct colours i\ and i 2 and for all 1 < ji, j 2 < t, we have 
2c | (y(*i,ji) — Y{i 2 ,j 2 ) | is greater than the Y -coordinate of any vertex. 



Proof. Without loss of generality i\ > i 2 . Observe that every F-coordinate is less 
thanp(2(c— l)t+t)+p = p(2ct—t+l) < 2cpt. Now 2c| Y(i x ,ji) — Y(i 2 , j 2 ) \ > 
2c | p(2 i\ t + 1) — p(2 i 2 t + t + 1) | > 2cp \ 2{i 2 + l)t — (2i 2 t + t) \ = 2 cpt. □ 

We first prove that the only vertices each edge intersects are its own end- 
points. It suffices to prove that if three tracks are collinear in the XF-plane 
then they are all from the same colour class. Loosely speaking, an edge does not 
pass through any track. Clearly two tracks from the same colour class are not 
collinear (in the XF-plane) with a third track from a distinct colour class. Thus 
we need only consider tracks {T(i a ,j a ) : 1 < i a < 3} from three distinct colour 
classes {ii,i 2 ,i 3 }. Let R be the determinant, 



R = 



1 

1 

1 



H 

*2 

h 



Y(ii,j i) 

Y {i 2 , jf) 

Y{h,h) 



If the tracks {T(i a ,j a ) : 1 < i a < 3} are collinear in the XF-plane then R = 0. 
However Y(i,j) = i 2 (mod p), and thus 



R = 



H 

ii 

h 



n 

l<a</3<3 



— ip) ^0 (mod p) 



since i a ^ ip, and p is a prime greater than any i a — ip. Thus R ^ 0, and the 
tracks {T(i a , j a ) : 1 < i a < 3} are not collinear in the XF-plane. Hence the 
only vertices that an edge intersects are its own endpoints. 

It remains to prove that there are no edge crossings. Consider two edges e 
and e! with distinct endpoints v(i a , j a , k a ), 1 < a < 4. (Clearly edges with a 
common endpoint do not cross.) Let Y a = Y(i a ,j a ). Consider the determinant 



1 i x Fi Z(i 1 ,j 1 ,k 1 ) 
1 * 2 F 2 Z(i 2 ,j 2 , k 2 ) 
1 * 3 F 3 Z(i 3 ,j 3 ,k 3 ) 
1 u F 4 Z(u,ji,ki) 



D = 



196 V. Dujmovic and D.R. Wood 



If e and e' cross then their endpoints are coplanar, and D = 0 . Thus it suffices to 
prove that D ^ 0 . We proceed by considering the number N = |{*i , i 2 , *3, *4}! of 
distinct colours assigned to the four endpoints of e and e' . Clearly N £ { 2 , 3 , 4 }. 

,k a ) = i 3 a (mod p), 



D = 



Since Y a 


= i 2 


1 ii 


9 2 

l l 


,•3 

*1 


1 *2 


9 2 


i 3 

*2 


1 *3 


9* 2 


7 3 

*3 


1 i 4 


9* 2 

Z 4 


7 3 

*4 



(mod p ) and Z(i a ,j a 



(*a ip) ^ 0 
l<a</ 3<4 



(mod p ) 



since i a ^ ip, and p is a prime greater than any i a — ip. Thus D/ 0 . 

Case N = 3 : Without loss of generality i 4 = i 2 . It follows that D = 5 Sq + 
S 4 T $2 T 5*3 T 5*4 where 



5 0 = Acpn'(i 3 - ii)(i 4 - *i)(^2 - Y 4 ){Y 3 - *4) 

51 = p(Y 2 - W) (M *4 - *1) - k 4 (i 3 - *i)) 

5 2 = p{u~ h){k 2 Y 4 - k\Y 2 ) 

5 3 = p{k 2 - fci)(y 4 (*3 - *1) - y 3{u - *1)) 

5 4 = (Y 2 - Yi)( (i 3 - * 4 )(*? mod p)-(i 3 - ii)(i 4 mod p) + (i 4 ~ *'i)(*l mod p ) ) . 

If Yi = Y 2 then e and e' do not cross, since no three tracks from distinct 
colour classes are collinear in the XY-plane. Assume Y\ ^ Y 2 . If i 3 < i\ < i 4 or 
i 4 < i\ < i 3 then e and e' do not cross, simply by considering the projection in 
the XF-plane. Thus i\ < i 3 ,i 4 or i 4 > i 3 ,i 4 , which implies 



(*4 - *i )(*3 - *1) > |*4 - *3 1 • 



(1) 



Claim 2 . If |S Q | > |Si|, \S 0 \ > \S 2 \, |S 0 | > |S 3 | and |S 0 | > |S 4 | then D^O. 

Proof. To prove that D = 5S0+S1+S2+S3+S4 is nonzero it suffices to show that 
D' = ±5|S , o|±|S , i|±| 5 , 2|±|5 , 3|±|5'4| is nonzero for all combinations of pluses and 
minuses. Consider X = ±|Si| ± |S 2 | ± IS3I ± |S 4 | for some combination of pluses 
and minuses. Since |Si| < |So|, |S 2 | < |So|, IS3I < |So|, an d |S 4 | < |So|, we have 
— 4 |S 0 | < X < 4 |S 0 |. Since S 0 0 , we have 5 |S 0 | + X ^ 0 and — 5 |S 0 | + 1 / 0 . 
That is, all values of D' are nonzero. Therefore D/ 0 . □ 

Therefore, to prove that D / 0 it suffices to show that |S 0 | > |Si|, |So| > 
| S 2 1 , jS'ol > 1 53 1 and |So| > IS// We will use the following elementary facts 
regarding absolute values: 

Vai, . . . ,a k £ K |aia 2 . . .a k \ = |ai||a 2 | ■ • • |a fc |, and 

|«i + c*2 H 1-Ofcl < |ai| + |a 2 | H h|a fe | < k ■ max{|oi|, |o 2 |, • . • , |ofc|} . 

• First we prove that |So| > |Si|. That is, 

|4cpn'(f 3 - *i)(*4 - ii){Y 2 - Yf){Y 3 — Y 4 )\ > \p(Y 2 - Y 4 )(k 3 (i 4 - if) - k 4 (i 3 — *i))| • 



Three-Dimensional Grid Drawings 



197 



Hence, |5o| > |Si| is implied if 

2n'\i 3 - «i ||* 4 - *i | |*3 - Y 4 \ > | fc 3 (* 4 - h) - k 4 (i 3 - ii)| . 

4= 2n '|* 3 - *i ||* 4 - *i||Y 3 - Y 4 | > 2 • max{|fc 4 (* 3 - *i)|, |fc 3 (* 4 - *i)|} . 

Since n' > fc 3 , k 4 and |Y 3 — Y±\ > 1, 

| -So I > |Si| 4= 1*3 - *i ||* 4 -ill > max {|* 3 - i 4 \, \u ~h\} ■ 

Thus |So| > |5i| since |* 3 — i\\ > 1 and |z 4 — *i| > 1. 

• Now we prove that |So| > l-S^I- That is, 

\4:Cpn'(i 3 -i 1 )(n-i 1 )(Y 2 -Y 1 )(Y 3 -Y4)\ > \p(i 4 - i 3 )(k 2 Y 1 - AtY 2 )| . 

By (1) and since |Y 2 — Yl| > 1, |5o| > |£ 2 | is implied if 1 4cn'(Y 3 — Y 4 ) | > 
\k 2 Y 1 — This in turn is implied if |2cn'(Y 3 — 14 ) | > max{|fc 2 Yi|, | fci Y 2 1 }, 

which holds since n’ > k 4l k 2 and |2c(Y 3 — Y 4 )| > max{Yi, Y 2 } by Claim 1. 

• Now we prove that |5 Q | > |5 3 |. That is, 

|4cpn '(« 3 - *i )(*4 - *i)(Y 2 - Yi)(Y 3 — Y 4 )| > |p(fc 2 - fci)( Y 4 (*3 - *i) - Y 3 (u - ii))| • 

Now n! > \k 2 — fci| and |Y 2 — Yi| > 1. Thus |5o| > |S 3 | is implied if 

| 4 c(t 3 -ti)(t 4 -ti)(y 8 -n)| > |n(t 3 -*i) - y 3 (* 4 -*i)| . 

4= | 2 c (*3 - *i )(*4 - *i)(Y 3 - Y 4 )| > max{|y 4 (i 3 -ii)| , |y 3 (i 4 - ii) |} , 

which holds since |2c(Y 3 — ^ 4 ) | > max{Yi, Y 2 } by Claim 1. 

• Finally we prove that |So| > |S 4 |. That is, 

|4cpn'(i3 - ti)(*4 - ii)(y a - Fi)(y 3 - n)| > 

\(Y 2 - Yi)( (* 3 - * 4 )(*1 mod p) - (*3 - *i )(*4 mod p) + (i 4 - ii )(*3 mod p) )| . 

Now cn’\Y 3 — Y 4 1 > 1. Thus |So| > |S 4 | is implied if 
|3p(i 3 - ii)(*4 - *i)| > 

I (*3 - *4)(*i mod p) - (i 3 - ii)(*4 mod p) + (u - ii )(*3 modp) | 

<= |3p(*3 - il)(i 4 - *l)| > 

3 • max {| (i 3 - i 4 )(*i modp)| , |(i 3 - ii)(i 4 modp)| , | (24 - *i)(i| modp) |} 

4= |(*3 - *i)(i 4 - *i)| > max {| i 3 — i 4 | , |*3 — ii| , |i 4 — * 1 1 } , 

which holds by ( 1 ). 

Case N = 2: Without loss of generality *i = i 2 7 ^ * 3 = * 4 . If Yi = Y 2 and 
Y 3 = Y 4 then e and e' do not cross as otherwise there would be an X-crossing 
in the track layout. If Y\ = Y 2 and Y 3 ^ Y 4 (or Y\ ^ Y 2 and Y 3 = Y 4 ) then e 
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and e! do not cross, by considering the projection in the XT-plane. Thus we can 
assume that Lj ^ Y 2 and Y 3 ^ Y 4 . It follows that 

D = p(i 4 -i 3 ) ( 5 • Acn'(Y 2 - Yi)(F 4 - F 3 )(*3 - *i) 

+ (h-k 2 )(Y 4 -Y 3 ) + {k 4 -k 3 ){Y 2 -Y 1 )) . 

As in Claim 2, to show that 0 it suffices to show that 

|4cn , (y 2 -yi)(K 4 -i r 3)(t3-*i)| > \{h-k 2 )(Y 4 -Y 3 )\ , (2) 

and |4cn'(y 2 - yi)(y 4 -y3)(*3 - *i) | > I (&4 - k 3 )(Y 2 - Y 4 ) \ . (3) 

Inequalities (2) and (3) hold since n' > \k 4 — k 2 \ and n! > \k 4 — k 3 \. □ 

Note that the constant 20 in the definition of Z(i,j,k) in the proof of 
Lemma 5 is chosen to enable a relatively simple proof. It is easily seen that it can 
be reduced. The proof of the next lemma is based on an idea of Pach et al. [24] 
for balancing the size of the colour classes in a vertex colouring. 

Lemma 6. Let G be an n-vertex graph with a c-colouring {Vj : 0 < * < c — 1} 
and a track layout {T-ij : 0 < i < c — 1,1 < j < ti\, such that each Tjj C Vj. Let 
k = tj be the total number of tracks. Then G has a 0(c) x O(ck) x 0(c 4 n ) 
drawing. 

Proof. Replace each track by tracks of size exactly [?], except for at most one 
track of size at most [?]. Order the vertices within each track according to the 
original track, and consider the new tracks to belong to the same colour class as 
the original. Clearly no X-crossing is created. Within Vi there are now at most 
ti + IWm tracks. The total number of tracks is + |Vi|/rfD — 2/c. For 

each colour class V], partition the set of tracks in Vi into sets of size exactly 
I"—], except for one set of size at most [ ^ ] . Consider each set to correspond 
to a colour. The number of colours is now at most c + 2k/\^f] < 2c. Applying 
Lemma 5 with 2c colours, n' = , and t = \^f~\ , we obtain the desired drawing. 

□ 



Theorem 5. Every c-colourable graph G with n vertices and track-number 
tn(G) < t has a O(c) x 0(c 2 t) x 0(c 4 n) drawing with 0(c 7 tn) volume. 

Proof. Let {Vi :0<*<c — l}bea c-colouring of G. Let {Tj : 1 < j < t} be a 
f-track layout of G. For all 0 < i < c — 1 and 1 < j <t, let T{ 4 = Vi fl Tj. Then 
{V. : 0 < i < c — 1} and {Tjj :0<*<c — 1, 1 < j < t} satisfy Lemma 6 with 
k = ct. Thus G has the desired drawing. □ 

In the case of bipartite graphs we have a simple proof of Theorem 5 with 
improved constants. 

Lemma 7. Every n-vertex bipartite graph G with track-number tn (G) < t has 
a 2 x t x n drawing. 
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Proof. Let {A, B} be the bipartition of V(G). Let {Ti : 1 < * < t} be a t-track 
layout of G. For each 1 < i < t, let = Ti fl A and B i = T i P\ B. Order each Ai 
and Bi as in Ti. Place the j th vertex in A { at (0, i, j + I A k \). Place the j th 

vertex in Bi at (1 ,t — i + 1 , j + Y^k=i \^k\)- The drawing is thus 2 x t x n. Let 
AiBj be the set of edges with one endpoint in Ai and the other in Bj. There 
is no crossing between edges in AiBj and AiBj as otherwise there would be 
an X-crossing in the track layout. Clearly there is no crossing between edges in 
AiBj and AiB k for j ^ k. Suppose there is a crossing between edges in AiBj 
and AkBg with i ^ k and j ^ l. Without loss of generality i < k. Then the 
projections of the edges in the XT-plane also cross, and thus j < i. Hence the 
projections in the XZ - plane do not cross, and thus the edges do not cross. □ 

Lemma 4 with tn(G) < Xsst(G) and Theorem 5 imply: 

Theorem 6. Every c-colourable graph with n vertices and m edges has a 0(c) x 
0(c 2 m 2 / 3 ) x 0(c 4 n) drawing with 0(c e m 2 ^ 3 n) volume. □ 

The next result is one of the main contributions of this paper. 

Theorem 7. Every planar graph with n vertices has a 0(1) x 0(n x ^ 2 ) x 0(n) 
drawing with 0(n 3 / 2 ) volume. 

Proof. Planar graphs have (^(n 1 / 2 ) path-width (see [2]), and thus have Ofn 1 / 2 ) 
track- number by Lemma 1(a). The result follows from Theorem 5 since planar 
graphs are 4-colourable. □ 

The following generalisation of Theorem 7 follows from Lemma 1(d), Theo- 
rem 5, and the classical result of Heawood [20] that x(G) S O (t 1 ^ 2 )- 

Theorem 8. Every n-vertex graph with genus 7 has a 0( 7 1//2 ) x G(7 3 / 2 n 1 / 2 ) x 
0(^ 2 n) drawing with G(7 4 n 3 / 2 ) volume. □ 

The next generalisation of Theorem 7 for graphs with no LG-minor fol- 
lows from Lemma 1(e), Theorem 5, and the result independently due to Kos- 
tochka [21] and Thomason [26] that y(G) 6 O^hlog 1 ^ 2 h). 

Theorem 9. Every n-vertex graph with no Kh-minor has a 0(h\og 1 ^ 2 h) x 
G(/) 7 / 2 log/i-n 1 / 2 ) x 0(h 4 log 2 h-n ) drawing with volume 0(h 17 / 2 log 7 / 2 h-n 3 / 2 ). 

Finally we consider the maximum degree A as a parameter. By the sequential 
greedy algorithm, G is (A + l)-colourable. Thus by Theorems 2 and 5 we have: 

Theorem 10. Every graph with n vertices, m edges, and maximum degree A 
has a O(A) x 0(A 3 l 2 mfl 2 ) x 0(A A n) drawing with 0(A 15 ^ 2 m 1 ^ 2 n) volume. □ 

Graphs with constant A have 0(n ) edges. By Theorems 8, 9, and 10 we have: 

Corollary 1. Every n-vertex graph with constant genus, or with no Kh-minor 
for some constant h, or with constant maximum degree has a 0(1) x G(n 4 / 2 ) x 
O(n) drawing with 0(n 3 ^ 2 ) volume. □ 
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We conclude with the following open problems: Does every graph have a 
3D drawing with 0(nm) volume? Does every graph with constant chromatic 
number have a 3D drawing with 0(ny/m) volume? These bounds match the 
lower bounds for K n and K n ^ n , and would make edge-sensitive improvements to 
the existing upper bounds of 0(n 3 ) and 0(n 2 ), respectively. These edge-sensitive 
bounds would be implied by Theorems 1 and 5 should every graph have 0(y/m) 
track-number. In turn, this bound on track-number would be implied should 
every graph have 0(y/rn) strong star chromatic number. As far as the authors 
are aware, a 0(y/m) bound is not even known for star chromatic number. The 
best known bound in this direction is Xst(G) < 11m 3 / 5 , which can be proved in 
a similar fashion to Lemma 4, in conjunction with the result of Fertin et al. [15] 
that Xst(G) < [20Z\ 3 / 2 ] (see [11]). 
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Abstract. In this paper, we study a layout problem of a digraph us- 
ing queues. The queuenumber of a digraph is the minimum number of 
queues required for a queue layout of the digraph. We present upper and 
lower bounds on the queuenumber of an iterated line digraph L k (G ) of 
a digraph G. In particular, our upper bound depends only on G and 
is independent of the number of iterations k. Queue layouts can be ap- 
plied to three-dimensional drawings. From the result on the queuenum- 
ber of L k (G), it is shown that for any fixed digraph G, L k (G) has a 
three-dimensional drawing with 0(n) volume, where n is the number 
of vertices in L k (G). We also apply these results to particular families 
of iterated line digraphs such as de Bruijn digraphs, Kautz digraphs, 
butterfly digraphs, and wrapped butterfly digraphs. 



1 Introduction 

Let H be a graph. The vertex set and the edge set of H are denoted by V (H) 
and E(H) : respectively. A k-queue layout of H consists of a linear ordering a 
of the vertices, i.e. , a is a bijection from V(H) to {1,2,... ,\V(H)\}, and an 
assignment of each edge to one of k queues so that the set of edges assigned 
to each queue obeys a first-in/first-out discipline. (Consider scanning in left-to- 
right (ascending) order. If the left vertex of an edge e is encountered, e enters its 
assigned queue. If the right vertex of e is encountered, e exits from its assigned 
queue.) We can formally define such a discipline as follows; if edges {a, &}, {c, d} 
are in the same queue such that er(a) < cr(&), tr(c) < <r(d), and cr(a) < <r(c), then 
one of the following inequalities holds: 

— cr (a) < a(b) < cr(c) < a(d) 

— a (a) < <r(c) < a(b) < a(d) 

The minimum number of queues required for a queue layout of H is the 
queuenumber of H , and denoted by qn(U). 

As a dual concept of a queue layout, a stack layout is known. A k-stack layout 
of H consists of a linear ordering a of the vertices, and an assignment of each 
edge to one of k stacks so that the set of edges assigned to each stack obeys a 
last-in/first-out discipline. (If the left vertex of an edge e is encountered, e is 
pushed into its assigned stack, and if the right vertex of e is encountered, e is 
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popped out of its assigned stack.) That is, if {a, b}, {c, d} are in the same stack 
such that er(a ) < cr(b), cr(c) < cr(d), and a(a) < a(c ), then one of the following 
inequalities holds: 

— a (a) < a(b) < cr(c) < a(d) 

— a(a) < a(c) < a(d) < a(b) 

The minimum number of stacks required for a stack layout of H is the stack- 
number of H , and denoted by sn (H). (A stack layout is equivalent to a book- 
embedding. In the study of book-embeddings, pagenumber is used instead of 
stacknumber.) Queue and stack layouts of a digraph are similarly defined as 
those of the underlying graph of the digraph. (Note that our definitions for di- 
graphs are different from queue and stack layouts defined in [19,20] in which all 
arcs must have the same direction with respect to the vertex ordering.) 

Queue and stack layouts are motivated by several area of computer science 
[5,9,18,21,25,27,29,30,31]. In particular, such layouts of interconnection networks 
have applications to the DIOGENES approach, proposed by Rosenberg [27], to 
fault-tolerant processors array. Also, Wood [30,31] have shown that queue and 
stack layouts can be applied to three-dimensional drawings. For queue layouts, 
he proved that every graph G with n vertices from a proper minor-closed family 
has an 0(1) x 0(1) x 0{n ) three-dimensional straight-line grid drawing if and 
only if G has 0(1) queuenumber. 

Until now, queue and/or stack layouts have been studied for many graph 
classes: 

— Stacknumber: complete graphs [4], complete bipartite graphs [24], butterfly 
graphs [12], trees, grids, X-trees [5], hypercubes [5,22], de Bruijn digraphs, 
Kautz digraphs, shuffle-exchange graphs [16], planar graphs [32], genus-g 
graphs [23], bandwidth-fc graphs [28], fc-trees [13], iterated line digraphs [14]. 

— Queuenumber: complete graphs, complete bipartite graphs, trees, grids, uni- 
cyclic graphs, X-trees, binary de Bruijn graphs, butterfly graphs (all in [21]), 
fc-tree [6,26,31], 

In this paper, we treat iterated line digraphs and study queue layouts for 
the class. Let G be a digraph. The vertex set and the arc set of G are de- 
noted by V{G) and A(G), respectively. The line digraph L(G) of G is defined as 
follows. The vertex set of L(G ) is the arc set of G, i.e. , V(L(G)) = A(G). 
The vertex (u,v) is a predecessor of every vertex of the form (v,w), i.e., 
A(L(G)) = {((w, v), (v, w)) | (u,v),(v,w) £ A(G)}. When we regard “L” as 
an operation on digraphs, the operation is called the line digraph operation. The 
k-iterated line digraph L k {G) of G is the digraph obtained from G by iteratively 
applying the line digraph operation k times. Iterated line digraphs have many 
desirable properties for interconnection networks of massively parallel computer 
such as bounded degree, small diameter, and high connectivity. In fact, the class 
of iterated line digraphs contains several well-known interconnection networks 
such as de Bruijn digraphs, Kautz digraphs, butterfly digraphs and wrapped 
butterfly digraphs. 
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We present upper and lower bounds on the queuenumber of an iterated line 
digraph L k (G). In particular, our upper bound depends only on G and is inde- 
pendent of the number of iterations k. As corollaries, upper and lower bounds 
on the queuenumbers of de Bruijn digraphs, Kautz digraphs, butterfly digraphs, 
and wrapped butterfly digraphs, some of which are generalizations of previously 
known results, are obtained. Also, it is shown that for any fixed digraph G, 
every digraph with n vertices in {L k (G) \ k > 1} has an 0(1) x 0(1) x 0(n ) 
three-dimensional straight-line grid drawing. 

This paper is organized as follows. In Section 2, we present upper and lower 
bounds on the queuenumber of L k (G). In Section 3, we apply the results to spe- 
cific families of iterated line digraphs. In Section 4, we consider three-dimensional 
drawings of iterated line digraphs. 



2 Queue Layouts of Iterated Line Digraphs 



We assume in this paper that a digraph may have loops but not multiple arcs, 
since L k {G) has no multiple arcs for all k > 1 even if G has multiple arcs. Let 
G be a digraph. If (x,y) £ A(G), then we say that £ is a predecessor of y , y 
is a successor of x, x is the tail of (x,y), and y is the head of ( x,y ). Two arcs 
such that the head of one arc is the tail of the other arc, are successive arcs. 
For v £ V(G), let Tq (v) denote the set of successors of v in G, and let A G (?;) 
denote the set of arcs with tail v in G. Also let (5 + (G) = min„ g y( G \ |A G (u)| and 
A+(G) = max 1 , 6y(G) |A£(u)|. Analogously, r G (u), A G (u), S~(G), and A~(G) 
are defined. 



2.1 An Upper Bound 

We first define a restricted queue layout, and then present an algorithm to con- 
struct such a restricted queue layout of L(G), based on a restricted queue layout 
of G, while preserving the number of queues. 

For a queue layout of a digraph and an arc e = (it, v) of the digraph, if 
<j{u) < a(v ) (resp. cr(v) < <7 (it)), then we say that e has right-direction (resp. 
left-direction). For a loop, we consider that it has no direction. We say that an 
arc with right-direction and an arc with left-direction have opposite directions. 
As a restricted queue layout, we define a tree-queue layout as follows. 

Definition 1. A tree-queue layout of a digraph G is a queue layout of G such 
that for arcs assigned to the same queue, the following two conditions hold. 

— any two arcs with the same head have opposite directions. 

— any successive arcs have the same direction except for a loop. 

The tree-queuenumber of G, denoted by tqn(G), is the minimum number of 
queues required for a tree-queue layout of G. 
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G: 





Fig. 1 . Algorithm for laying out L(G). 



Since a tree-queue layout is a restricted version of a queue layout, it holds that 
qn(G) < tqn(G). On the other hand, given a fc-queue layout, we can construct 
a tree-queue layout as follows. For each queue, we divide the set of arcs in the 
queue according to the direction, and assign arcs with the same direction to the 
same queue. A loop is assigned to either queue. Next, for each queue, we assign 
arcs with the same head in the queue to distinct queues. Thus, a layout using at 
most 2 kA~(G) queues is obtained. The resulting layout is a tree-queue layout. 
Therefore, tqn(G) < 2zi _ (G)qn(G). 

The terminology “tree-queue” named after the structural property that the 
set of arcs assigned to each queue induces a nearly disjoint union of rooted trees, 
where “nearly” means that two trees may have common leaves and the root of 
a tree may have a loop. 

We will show that the tree-queuenumber of G is an upper bound on the 
tree-queuenumber of L k (G). 

Theorem 1. tqn(L fe (G)) < tqn(G) for all k > 1. 

Proof. Given a tree-queue layout of G, we present an algorithm to construct a 
tree-queue layout of L(G) with the same number of queues. For convenience, we 
consider a line and regard the vertex ordering as a placement of vertices on the 
line. A vertex x is placed on the left of another vertex y iff er(x) < a(y). We 
write x <i y if a; is placed on the left of y, and x <i y if x is not placed on the 
right of y. Note that x may equal to y when x <i y. 

1. For each vertex u of G, we first regard all the tails of arcs in Aq(u) as being 
different and then stretch the point corresponding to u on the line so that 
any two arcs in Ajt(it) obey the first-in/first-out discipline. Then, we assign 
each arc in Aq(u) to the point of its tail (see Figure 1). More formally, the 
vertices of L(G) are placed on the line in the following way: 
a) If u <i w, then (u,a) <i (w,b) for any a £ Pq(u), b £ Pq(w). 
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b) Suppose that (u,v),(u,w) £ A(G). 

i. If v <i u <i w, then (u,v) <i ( u,w ). 

ii. If (it, it) £ A{G) and v <i it <i w, then (u,v) <i (it, it) <i ( u, tit ). 

iii. If u <i v <i w, then (it, it) <i ( u,w ). 

iv. If it; <; it <i it, then (u, w) <i (it, it). 

According to this placement of vertices on the line, we define the vertex 
ordering of L(G). 

2. For each arc ((u, it), ( v , w)) of L(G ), we assign it to the queue to which (u, it) 
is assigned in the tree-queue layout of G. 

In what follows, we show that the above algorithm correctly produces a 
tree-queue layout of L(G). We first show that any two arcs in each queue obey 
the first-in/first-out discipline. Let ((it, it), (it, wt)), ((x, y), (y, z)) £ A(L(G)) such 
that they are assigned to the same queue. This means that (it, v) and (a;, y) are in 
the same queue in the tree-queue layout of G. If it = x , then clearly ((it, it), (it, w)) 
and ((x, y), (y, z)) obey the first-in/first-out discipline, by our algorithm. Then 
suppose that u ^ x. Without loss of generality, we can assume that u <i x. Thus, 
(it, v) <i ( x,y ) (by l-(a) in the algorithm). Now we assume that ((it, n) , (n, iti)) 
and ((x, y), (y, z)) do not obey the first-in/first-out discipline, i.e., one of the 
following inequalities holds: 

Case 1: (it,i;) < ; (x,y) <i ( y,z ) <i (v,w) 

Case 2: (y,z) <i (v,w) <i (it, n) <i (x,y) 

Case 3: (it, u) <; (y,z) <i (x,y) <i (v,w) 

Case 4: (y,z) <i (it, i;) <i (v,w) <i (x,y) 

Assume that Case 1 holds. By our algorithm, we can see that 1 1 <i x <i y <i v. 
Since two arcs with the same head must have opposite directions, y ^ v. Hence 
u <i x <i y <i v. However, this contradicts the first-in/first-out discipline. 
Therefore, Case 1 does not hold. Similarly, Case 2 does not hold. Next, consider 
Case 3, and assume that it holds. Then we obtain that u <i y <i x <i v. 
If it ^ y and x ^ v, then (it, i;) and (x,y) contradicts the first-in/first-out 
discipline. Thus, it must be that it = y or x = v. Suppose that x = y, i.e., (x, y) 
is a loop. If x = v, then (x,y) and (it, v) have the same head but not opposite 
directions. Thus, it = y. However, by our algorithm (l-(b)-(ii)), (x,y) must be 
placed on the left of (u,v), which contradicts the situation in Case 3. Suppose 
that x ^ y, i.e., (x, y) is not a loop. Then (it, v) and (x, y) are successive arcs with 
opposite directions, which contradicts a tree-queue layout of G. Hence, Case 3 
does not hold. It can be similarly shown that Case 4 does not hold. Therefore, 
our algorithm correctly produces a queue layout of L(G). 

Next, we show that the queue layout of L(G) is a tree-queue layout. Sup- 
pose that ((a,u),(u,v)),((b,u),(u,v)) £ A(L(G)) such that these two arcs are 
assigned to the same queue. From our algorithm, (a, it) and (6, it) must be in 
the same queue in the tree-queue layout of G such that their directions are 
opposite. Note that neither (a, it) nor (6, it) is a loop. Since the directions of 
((a, it), (it, it)) and ((6, it), (it, v)) are the same as those of (a, it) and (b, it), re- 
spectively, ((a, it), (it, it)) and ((&, it), (it, i;)) have opposite directions. 
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Next suppose that ((u,v), (v,w)) and ((v,w), (w,x)) are in the same queue. 
Then (u, v) and (v, w) are in the same queue in the tree-queue layout of G. 
Thus, (u,v) and (v,w) have the same direction, or (u,v) is a loop. If (u,v) 
is not a loop, then ((u,v),(v,w)) and ((v, w), (w, x)) have the same direction. 
Suppose that (u,v) is a loop. Without loss of generality, we can assume that 
u = v <i w. Then by our algorithm (l-(b)-(ii)), (u,v) <i (v,w) <i (w,x). 
Therefore, ((u,v), (v,w)) and ((v,w), (w,x)) have the same direction. 

Consequently, the layout of L(G) obtained by our algorithm is a tree-queue 
layout with the same number of queues in the tree-layout of G. By applying the 
algorithm iteratively, we have tqn(L fe (G)) < tqn(L fe_1 (G)) < ■ ■ • < tqn(G). □ 

Corollary 1. qn(L fc (G)) < tqn(G) for all k > 1. 

It can be easily checked that S~(G) < tqn(G) < |V(G)|. (Thus, for the com- 
plete digraph K ° , it holds that tqn (K°) = n.) Therefore, we have the following 
corollary. 

Corollary 2. qn(L fc (G)) < |V(G)| for all k> 1. 



2.2 A Lower Bound 

It has been shown in [21] that in a queue layout of a graph with n vertices, one 
queue has at most 2n — 3 edges. Thus, a general lower bound on the queuenumber 
of a graph is obtained as follows. 

Theorem 2. [21] Let G be a graph with n vertices. Then qn(G) > 

The number of arcs in L k (G) cannot be expressed in a simple form in general. 
Besides, we need to consider not only the number of arcs, but also the numbers 
of loops and cycles of length two (symmetric arcs) in a digraph in order to 
count edges in the underlying graph. Then, we use a structural property of a 
line digraph. 

For a vertex v with no loop in G, L(G) has a complete bipartite digraph 
corresponding to v, since Aq(v)U Aq(v) forms a complete bipartite digraph with 
partite sets of size | Aq(v)\ and |Ag(n)| in L(G). The queuenumber of a complete 
bipartite graph K m ^ n with partite sets of size m and n has been determined in 
[21] to be min{|"f ], [§]}, 

If G has only loops, then A~(G) = A + {G) = 1. On the other hand, if G has 
an arc which is not a loop and max{<5“(G), 5 + (G)} > 0, then there is a vertex 
with no loop in L k (G) for all k > 1. Now suppose that G has an arc which 
is not a loop and max{<5“(G), <5 + (G)} > 0. Let v be a vertex with no loop in 
L k (G). It can be easily checked that each vertex in L k (G ) corresponds to a walk 
of length k in G. Suppose that v corresponds to a walk (vi,V 2 ,. .. ,Vk+i) in G. 
Since |A“ fc(G) (u)| = |A G (m)| and |A+ fc(G) (i;)| = |A^(u fc+ i)|, L k+1 (G) contains 

a complete bipartite digraph with partite sets of size |A G (m)| and |A G (ufc + i)|. 

Thus, qn(L fe+1 (G)) > minj 1 ^ 0 ' , |A o^+ l)| .}. Therefore, the following theo- 
rem holds. 



|g(G)| 
2n— 3 
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Theorem 3. qn(L fe (G)) > min{ |~ - ^ for all k > 2. 

Suppose that G has a loop. Let diam(G) denote the diameter of G. (When G 
is not strongly connected, we define diam(G) as oo.) Then for all k > 2diam(G), 
there is a walk of length k from a vertex with indegree A~(G) to a vertex with 
outdegree A + (G) passing through a loop. Thus, the following theorem holds. 
(Note that it trivially holds for the case that G has only loops, and the case that 
G is not strongly connected.) 

Theorem 4. Let G be a digraph with a loop. Then, 

qn (L k (G)) > rnin{ [ A j G ^ ], \ A for all k > 2diam(G). 

3 Queue Layouts for Specific Classes 

3.1 De Bruijn and Kautz Digraphs 

The de Bruijn digraph B{d,D) can be defined as L D_1 (A))) where is the 
complete digraph with d vertices [11]. The Kautz digraph K(d,D ) can be also 
defined as L d ~ 1 ( k K^, 1 ), where K* l+[ is the complete symmetric digraph with d+ 
1 vertices. These digraphs are representative interconnection networks in iterated 
line digraphs, and have many nice properties (see [3]). As a direct consequence 
of Corollary 2 and Theorem 3, upper and lower bounds on qn {B{d,Dj) and 
qn(A"(d, D)) are obtained. (These results can be also obtained as corollaries of a 
result on the queuenumbers of generalized de Bruijn and Kautz digraphs [15].) 

Proposition 1. |~|"| < qn (B(d,D)) < d (D > 3). 

Proposition 2. [|] < qn(K(d, D)) < d + 1 (D > 2). 

3.2 Butterfly Digraphs 

The butterfly graph is one of the most popular interconnection networks. The 
k-ary butterfly digraph b(k,r) is a directed version of the k- ary butterfly graph, 
and can be defined as an iterated line digraph. As an origin digraph to which 
the line digraph operation is applied, there are two digraphs. 

The Kronecker product of two digraphs G i and G 2 , denoted by Gi ® G 2 , is 
defined as follows: 

f V(G\ 0 G 2 ) = V(Gi) x V(G 2 ), 

\ A(Gi ® G 2 ) = {((ui, u 2 ), (ui, v 2 )) | («i, ui) G A(Gi) and (u 2 , v 2 ) G A(G 2 )}. 

Then it holds that b(k. r) = L r ~ 1 (K % ® P 2 r), where P 2r is a directed path with 
2r vertices [2]. 

A complete k-ary out-tree is an out-tree such that every non-leaf vertex has 
outdegree k, and every path from the root to a leaf has the same length. A 
complete k-ary in-tree is an in-tree obtained from a complete k-ary out-tree by 
reversing the orientations of arcs. Let X(k,r ) denote the digraph obtained from 
the complete k-ary in-tree of depth r and the complete fc-ary out-tree of depth 
r by identifying their roots. Then it holds that b(k,r) = L r (X(k,r)) [17]. 
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Fig. 2. 6(3, 2), A 3 ° ® Pi, and X{3, 2). 



Propositions. |~|] < qn(6(fc, r)) < min {fc, [|J + 2}. 

Proof. The fc-ary butterfly graph contains a complete bipartite graph Kk,k- 
Thus, |"|] is a lower bound on qn(6(fc,r)). For an upper bound, we first con- 
sider a tree-queue layout of K k C§> P 2r . Let V (K k ) = {vi,v 2 ,... ,Vk} and 
V(P 2r ) = {wi,w 2 , ■ ■ ■ ,w 2r } such that (wj,Wj + i) £ A(P 2r ) for 1 < i < 2 r. 
Then V(K k ® P 2r ) = {( Vi,Wj ) | 1 < i < fc, 1 < j < 2 r}. As a vertex-ordering 
of K k ® P 2r , we employ the lexicographical ordering with respect to the indices 
j and i of ( Vi,Wj ). (In the following, we omit the notation a for the vertex- 
ordering.) 

Oi, Wi) < ■■■ < (v k , Wl) < (l>l, w 2 ) < ■ ■ ■ < (v k -l,W 2r ) < ( Vk , W 2r ). 

Let A t = {((vt, w p ), ( Vi , Wp+i)) | 1 < i < k,l < p < 2 r} for t = 1, 2, . . . , k. We 
prepare k queues and assign the arcs in A t into the t-th queue. It can be easily 
checked that this layout is a tree-queue layout. 

Next we consider a tree-queue layout of X{k,r). It is not difficult to see 
that the complete k - ary out-tree of depth r has a 1-tree-queue layout. For the 
complete k - ary in-tree of depth r , we prove by induction on the depth that it 
has a (L|J + l)-tree-queue layout. Let T(k,r ) denote the complete fc-ary in- 
tree of depth r. Also, let V(T(k,r)) = {(*,j) | 1 < i < , 0 < j < r}, where 

j denotes the depth of a vertex (i,j). Note that (1,0) is the root of T(k,r). 
Suppose that r = 1. We order the vertices as follows. (1,1) < • • • < (L|J , 1) < 
(1,0) < (L|J + 1,1) < ••• < (fc,l)- Then we assign arcs ((*, 1), (1, 0)) and 
((L|J + i, 1), (1, 0)) to the i-th queue (1 < i < L|J)- When k is odd, we assign 
the arc ((fc, 1), (1, 0)) to the ( [f J +l)-th queue. Clearly, this layout is a tree-queue 
layout. Now assume that T(k, r) has a tree-queue layout using [f J + 1 queues. 
By adding k arcs to each leaf of T(k,r), T(k,r + 1) is obtained. For each leaf 
of T(k,r), we do the similar manipulation to the case of r = 1. Let v be a leaf. 
Also let (v, w ) be the arc with tail v. About half of r+1 , (v) is placed on the 
left of v, and the remaining half is placed on the right of v. Then we assign two 
arcs with opposite directions in Alf, k r+1 \(u) to each queue. Here, if k is even, 
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then we assign no arc to the queue to which (v, w) is assigned. If k is odd, then 
we assign one arc with the same direction as (v, w) to the queue to which (v, w) 
is assigned, and use the other queues for the remaining arcs. (Thus, if (v, w) has 
right (resp. left) direction, then |_fj + 1 vertices in r~ (k r+] ) ('(-’) are placed on 
the left (resp. right) of v.) After such an assignment of arcs, we precisely set the 
order position of each tail while preserving the tree-queue layout style. Such a 
setting is always possible, since the underlying graph of a digraph induced by the 
set of arcs assigned to each queue is a disjoint union of paths such that for any 
vertex x with degree two in the graph, one neighbor is on the left of x and the 
other is on the right of x. Therefore, T(k , r) has a tree-queue layout with [|J + 1 
queues, and thus X(k,r) has a tree-queue layout with [f J + 2 queues. □ 

3.3 Wrapped Butterfly Digraphs 

The fc-ary wrapped butterfly digraph wb(k,r), r > 3, can be defined as 
L r ~ 1 (K k ® CV), where C r is a directed cycle of length r [2]. 

Proposition 4. I’ll < qn(wb(k,r)) < 2k. 

Proof. Similarly to b(k,r). is a lower bound on qn (wb(k,r)). Let V(K%) = 
{v\,v 2 ,... ,Vk} and V(C r ) = {w\,w 2 ,... ,w r } such that (wi,w i+ 1 ) G A(C 2r ) 
for 1 < i < r and (w r ,w 1 ) G A(C 2r ). Similarly to the first case of butterfly 
digraphs, we order the vertices, and assign the arcs into k queue, except for the 
arcs ((vi,w r ), (Vj,wi)), 1 < i,j < k. We prepare other k queues and assign the 
arcs in {((u t , uy), (vj,w\)) | 1 < j < k} into the (fc + t)-th queue. We can easily 
check that the resulting layout is a tree-queue layout. □ 

4 Three-Dimensional Drawings of Iterated Line Digraphs 

A three-dimensional drawing treated here is a three-dimensional straight-line 
grid drawing of a graph. The vertices are represented by distinct points in Z 3 . 
Each edge are represented by a line-segment between its end- vertices such that 
edges only intersect at common end- vertices, and an edge only intersects a vertex 
which is an end-vertex of the edge. If a three-dimensional drawing is contained 
in an axis-aligned box with side length X — 1, Y — 1, Z — 1, then the drawing 
is called an X x Y x Z drawing with volume X ■ Y ■ Z. A three-dimensional 
straight-line grid drawing of a digraph is similarly defined. 

Dujmovic et al. [8] introduced a track layout. A k-track assignment of a 
graph G consists of a partition Vi, V 2 , . . . ■ 14 of V(G) such that each Vi is an 
independent set (i.e. , if (it, v} G E(G) such that u G Vi and v G V), then i ^ j ) 
and a linear ordering eq of each V). Each pair of Vi and cr,; is called a track. An 
X-crossing in a track assignment consists of two edges {i>,w} and {x,y} such 
that v : x G Vi and w,y G Vj, where i ^ j, and <Ji{v) < cri(x), crj(y) < 

A fc-track assignment with no X-crossing is called a k-track layout. The track- 
number of G, denoted by tn(G), is the minimum number of tracks required for 
a track layout of G. Dujmovic et al. proved the following. 
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Theorem 5. [8] If G has a k-track layout, then G has a k x 2k x 2 kn' three- 
dimensional drawing , where n' is the maximum number of vertices in a track. 

The star chromatic number of a graph is the minimum number of colors re- 
quired for a vertex-coloring of the graph such that each bichromatic subgraph is a 
disjoint union of stars. Based on the star chromatic number and the queuenum- 
ber, Wood [31] presented an upper bound on the track-number. (The upper 
bound has been improved in [7].) 

Theorem 6. [7] Let G be a graph with star chromatic number Xst(G) < c, and 
queuenumber qn(G) < q. Then tn(G) < c(2 q + l) c_1 . 

In Section 2, we have shown that for any fixed digraph G, the queuenumber 
of every L k {G) is at most the tree-queuenumber of G. Also the underlying graph 
of L k {G) has bounded star chromatic number, since the maximum outdegree 
and indegree of L k (G) are equal to those of G, respectively, and it has been 
shown that graphs with bounded maximum degree have bounded star chromatic 
number [1,10]. Thus, we have the following theorem. 

Theorem 7. For any fixed digraph G, every iterated line digraph of G has an 
0(1) x 0(1) x O(n) three-dimensional drawing. 



Corollary 3. For any fixed d > 2, B{d , D) and K(d , D) have three-dimensional 
drawings with 0(n ) volume. 

For b(k,r) and wb(k,r), the origin digraphs to which the line digraph op- 
eration is applied, depend on r. However, their maximum degree and upper 
bounds on the queuenumber depend only on k. Thus, we also have the following 
corollary. 

Corollary 4. For any fixed k > 2, b{k,r) and wb(k,r) have three-dimensional 
drawings with O(n) volume. 

For B(2,D) and &(2,r), their three-dimensional drawings with O(n) volume 
was previously shown in [31]. 

We can show that the number of vertices in G is an upper bound on the 
track-number of L k (G ), if we restrict ourselves to digraphs with no loop and no 
symmetric arcs. 

Theorem 8. Let G be a digraph with no loop and no symmetric arcs. Then 
tn (L k (G)) < |V(G)| for all k> 1. 

Proof. Let V(G) = {vi,V 2 , ■ ■ ■ ,v p }. We order the vertices of G in any manner. 
Then we prepare p queues, and assign the arcs in A^Uj) to the i-tli queue. This 
trivial queue layout is indeed a tree-queue layout. Also this layout corresponds 
to a p-track layout by considering that each track V) consists of only Vi. 

By applying the algorithm in Theorem 1, we can obtain a tree-queue lay- 
out of L(G) using the same number of queues. We show that this layout also 
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corresponds to a p-track layout, where each track V) = Ag(uj) and the linear 
ordering of each Vi follows the vertex ordering of the queue layout. Since G has 
no loop, there is no arc between the vertices in each track Vj. Thus, the queue 
layout corresponds to a p-track assignment. Since G has no symmetric arcs, for 
any V % and Vj (i ^ j), all arcs between Vi and Vj are in the same queue. Also 
all vertices in each track are consecutive with respect to the vertex ordering cr 
of the queue layout of L(G), i.e. , if for some u £ V. and v £ Vj, er(u) < cr(v), 
then for every x £ V t and y £ Vj, er(x) < cr(y). Therefore, there is no X-crossing 
in the track assignment. 

Similarly, we can see that the tree-queue layout of L k ( G ) obtained by apply- 
ing the algorithm iteratively, corresponds to a p-track layout, where each track 
Vi consists of the vertices corresponding to walks of length k starting from v* 
in G, and the linear ordering of V. follows the vertex ordering of the tree-queue 
layout of L k (G). □ 



Corollary 5. Let G be a digraph with no loop and no symmetric arcs. Also let 
p = |V(G)|. Then L k {G) has a p x 2p x 2 pn' three-dimensional drawing, where 
n! is the maximum number of vertices in a track. 
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Abstract. A fc-track drawing is a crossing- free 3 D straight-line drawing 
of a graph G on a set of k parallel lines called tracks. The minimum value 
of k for which G admits a fc-track drawing is called the track number of G. 
In [9] it is proved that every graph from a proper minor closed family has 
constant track number if and only if it has constant queue number. In this 
paper we study the track number of well-known families of graphs with 
small queue number. For these families we show upper bounds and lower 
bounds on the track number that significantly improve previous results 
in the literature. Linear time algorithms that compute track drawings of 
these graphs are also presented and their volume complexity is discussed. 



1 Introduction and Overview 

The problem of computing a drawing of a graph with small area/volume has 
received a lot of attention in the graph drawing literature during the last decade. 

Felsner et al. [5] initiated the study of integer grids consisting of parallel 
grid lines, called tracks. In particular, they focus on the box and the 3-prism. 
A box is a grid consisting of four parallel lines, one grid unit apart from each 
other and a 3-prism uses three non-coplanar parallel lines. It is shown that all 
outerplanar graphs can be drawn on a 3-prism where the length of the lines is 
0(n). This result gives the first algorithm to compute a crossing-free straight- 
line 3D drawing with linear volume for a non-trivial family of planar graphs. 
Moreover it is shown that there exist planar graphs that cannot be drawn on 
the prism and that even a box does not support all planar graphs. 

Dujmovic et al. [2] show that if a graph G admits a drawing T on a a grid 
c t> consisting of a constant number of parallel lines, then G has a linear volume 
upper bound. This result suggests that the focus of the research should be on 
minimizing the number of tracks in a restricted integer grid, independent of the 
length of the tracks themselves. The track number tn(G) of a graph G is the 
minimum number of tracks that is required to compute such a drawing. 

* Research partially supported by “Progetto ALINWEB: Algoritmica per Internet e 
per il Web”, MIUR - PRIN. We thank Giuseppe Liotta for useful discussions on the 
subject of this paper. 
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Wood [9] shows a relationship between tn(G) and another well-studied graph 
parameter, the queue number qn(G) (i.e. the minimum number of queues in a 
queue layout of G [8]). He proves that every graph from a proper minor closed 
family has constant track number if and only if it has constant queue number. 
By the result of Wood all families of graphs whose queue number is known to 
be constant (for example series-parallel graphs, Halin graphs, Benes networks, 
arched leveled planar graphs, X-trees, unicyclic graphs), have a three dimensional 
straight-line grid drawing with linear volume. A recent result by Dujmovic and 
Wood [4] shows that linear volume can also be achieved for graphs with bounded 
tree width. We observe however that value of the track number (and hence of the 
volume) deriving from the results above are often very large. In [3] it is shown 
that tn(G ) < c(2qn{G ) + l) c_1 where c is the star chromatic number of G ( 
c > 3 for any graph). Therefore, even for the (apparently innocent) family of 
planar graphs whose queue number is 1 we obtain an upper bound for the track 
number of at least 27 and by using the technique in [2] a drawing with a bound 
on the volume of 54 x 59 x 59 . From the observation above it is natural to 

ask whether it is possible to reduce the bounds on track number and volume for 
graphs with constant queue number. 

In this paper we present new lower and upper bounds on the track number 
(and hence on the volume) of some families of graphs that are known to have 
constant queue number. Our main contributions can be listed as follows. 

— The family of graphs with queue number one (i.e. arched leveled planar 
graphs) are proved to have track number at least four and at most five. A 
drawing algorithm is presented for arched leveled planar graphs that gives a 
volume bound of 3 x 3 x n. 

— The track number of X-trees (that have queue number 2) is proved to be 
three and a volume bound of 2 x 2 x 4| ~ K 7 l ~ 1 - ) . 

— A lower bound of three and an upper bound of four is presented for Halin 
Graphs. A volume bound of 2 x 2 x n is also shown. 

The drawing algorithm described in order to prove the above results all have 
O(n) time complexity and use integer arithmetic. 

The remainder of of the paper is organized as follows. In Section 2 some 
definitions and results about track and queue layout are recalled. The bounds 
on the track number and on the volume of arched leveled planar graphs are 
proved in Section 3. The track number of X-trees and Halin Graphs is studied 
in Section 4. For reasons of space some detail are omitted. 

2 Preliminaries 

Let G = (V. E) be a graph. A track assignment of G consists of a partition 
{ti | i S I C N} of H, and of a total ordering <; of the vertices in each set 
ti . Each set ti is called a track-set. An overlap in a track assignment consists of 
three vertices u, v , and w such that they are in the same track-set t t , there exists 
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the edge (u, w ) and u <i v <* w. An X-crossing in a track assignment consists 
of two edges (uo,vq) and (ui,Vi) such that uq and u\ are on the same track-set 
tj, Vo and Vi are on another track-set tj ( i ^ j) with u o <i U\ and Vi <j Vq- 
Figure 1 (a) shows an example of track assignment for a graph G. Vertices Vi, v$ 
and V2 form an overlap, as well as vertices V3, vq and V4. Edges (175,14), (^2,^3) 
form an X-crossing. Another X-crossing is formed by edges (vq, vg) and (174, V7). 

A track layout is a track assignment with no overlaps and no X-crossings. 
Figure 1 (b) shows an example of a track layout of the graph of Figure 1 (a). A 
track layout with k track-sets is also called a /c-track layout. The track number 
of a graph G, denoted by tn(G), is the minimum k such that G has a /c-track 
layout 1 . A set of k track-sets is also called a k-prism. 

In the rest of the paper a track layout will be specified by assigning to each 
vertex v two numbers: track{v) is an integer that denotes the index of the track- 
set containing w; order(v ) is an integer that denotes the ordering of v in track{v). 
We say that u <i v if track(u ) = track{y ) = i and order(u ) < order{y). We 
shall sometimes simplify the notation and write u < v instead of u <j v. 

A track is a set of 3 D grid points on straight line of infinite length. We always 
assume that a track is parallel to the x-axis, thus a track is the set of all the grid 
points having the same y- and ^-coordinate. We denote as (x, Y, Z) a track whose 
points have y-coordinate Y and ^-coordinate Z . A track drawing of a graph G 
on k tracks, also called /c-track drawing, is a 3 D straight-line crossing- free grid 
drawing of G such that each vertex of G is drawn on one of k tracks. The drawing 
in Figure 1 (c) is a track drawing of the graph of Figure 1 (a). The k tracks of a 
fc-track drawing will be denoted as To, Ti, . . . , T^-i- A strip atj is the portion of 
plane delimited by tracks Ti and Tj. A strip is assumed to be an open set. Two 
strips < Tij and a hi are called crossing strips if they cross each other. If a drawing 
of a graph is contained in an axis-aligned box with side lengths X — 1 , Y — 1 
and Z — 1 then we say that the volume of the drawing is X x Y x Z. 










"^y 




(a) 



(b) 



(c) 



Fig. 1. (a) A track assignment of a graph G. (c) A track layout of G. (d) A track 
drawing of G. 



1 The concepts of track assignment and track layout are introduced also in [2] , where 
it is assumed that no edge has both vertices in a same track-set. 
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In [2] a general technique is described for computing a track drawing of a 
graph G from a track layout of G. In order to reduce the volume of the drawing 
we use a different technique to obtain a track drawing from a track layout. 
The obtained drawing have neither overlaps nor X-crossings, because there is 
no overlap and no X-crossing in a track layout. The only possible crossings are 
crossings between two edges on a pair of crossing strips. Let eg = (ug,vg) and 
ei = (ui,v\) be two edges of G on two crossing strips and a hi- If eg and ei 
cross each other, then the four points representing ug, vg, U\ and iq are co-planar, 
i.e. the following equation is satisfied: 



1111 

x{ug) x(u{) x(vg) z(l>i) 
y(ug) y{u i) y(v o) y(v i) 
~(«o) z(ui) z(v o) z(v- l) 



= 0 



The substitution of the y- and ^-coordinates of each vertex in the equation above 
gives a condition on the ^-coordinates of the vertices that must be satisfied in 
order to have a crossings between eg and e\. Thus it is sufficient to prove that 
the equation has no solution in order to prove that eg and ei do not cross each 
other. We call this equation co-planarity equation of eg and e\. 

A queue layout [8] of a graph G consists of a linear ordering A of the vertices 
of G, and a partition of the edges of G into queues, such that no two edges in 
the same queue are nested with respect to A. In other words there are no edges 
eg = (ug,vg) and e\ = (iti,ui) such that eg and ei are assigned to the same 
queue and ug < u\ < Vi < vg in A. A queue layout with q queues is also called 
a q-queue layout. The queue number of a graph G, denoted by qn(G), is the 
minimum q such that G has a g-queue layout. 

The following theorem [3] relates the track number of a graph to the queue 
number of a graph. 



Theorem 1. [3] Let G be a graph with star chromatic number y st (G) < c, and 
queue number qn(G) < q. Then G has a t-track layout, where t < c(2q + l) c_1 . 



3 Graphs with qn = 1 

Graphs with queue number equal to 1 are characterized in [8] , where it is shown 
that they are planar graphs and admit a leveled planar embedding. For this 
reason they are called arched leveled planar graphs. We first give a lower bound 
on the track number and then we present an upper bound that improves the 
result in [9]. We start with a basic lemma that is used to prove the lower bound. 

Lemma 1 . Let Gg be the graph in Figure 2(a). Then in any 3-track layout , 
vertices u and v are in different track-sets, and at least one of the vertices wt 
(i = 0, ... ,3) is on the third track-set. 

Lemma 2. Let G be the graph in Figure 2(b). Then qn(G) = 1 and tn(G ) > 4. 
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(a) 



(b) 



Fig. 2. (a) Graph Go of Lemma 1. (b) Graph G with qn(G) = 1 and tn(G) = 4. 



Proof. The proof that G has queue number 1 follows from Figure 2(b) where 
a 1-queue layout of G is given. We prove that it does not admit a track layout 
on 3 track-sets. Suppose for a contradiction that there exists a 3-track layout 
of G with track-sets to, t\ and t 2 , and let to be the track-set containing u. By 
Lemma 1 vertices Vi (0 < i < 5) must be in a track-set different from to- Let t\ 
be the track-set containing V 5 . Two cases are possible: 

1. There exist three distinct vertices Vi, Vj and Vk (0 < i,j,k < 4) that are in 

t\. In this case two of the three edges (v 5 ,Vj), and (vs,Vk) form an 

overlap. 

2. There exist three distinct vertices Vi , Vj and Vk, that are in i 2 . By Lemma 1, 
there exist three vertices Wi ta , Wj,b and Wk, c (0 < a, b, c < 3) in ti adjacent 
to u and to Vi, Vj and Vk respectively. Assume without loss of generality 
that Vi < Vj < Vk in f 2 . It follows that Wi^ a < Wj ^ < Wk, c in ti> because 
otherwise there would be an A-crossing between edges (vi,Wi^ a ), {vj,Wj t b) 
and ( Vk,Wk, c )■ If ^5 < »i,a then edges (05 ,vj) and ( Vi,Wi <a ) form an X- 
crossing. If w, )a < v$ < Wj.b then edges (r> 5 ,Vk) and ( Vj,Wj t b ) form an X- 
crossing. If Wj t b < v$ < Wk, c then edges (i> 5 ,Vi) and (vj,Wj t b) form an X- 
crossing. Finally, if v$ > Wk, c then edges (v$,Vj) and (vk,Wk, c ) form an 
A-crossing. 

It follows that tn(G ) >4. □ 

In order to prove the upper bound on the track number of arched leveled 
planar graphs we first describe a block decomposition of a connected graph G 
with qn(G) = 1. Then we will describe how to assign the vertices to 5 track-sets. 
Suppose we have a 1-queue layout of G where the linear ordering of the vertices 
is A = vo,Vi,... ,v n _i . We say that n, < Vj if i < j. A vertex v is called a 
special cut vertex if there are no edges ( u , w) with u < v < w. A block is a 
subset of consecutive vertices in A. The first vertex of a block Bi is called source 
vertex and is denoted as Si. The last vertex of a block Bi is called sink vertex 
and is denoted as t, . All the other vertices are called internal vertices of Bi. The 
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block decomposition is defined as follows: (i) Block Bo consists of the vertices 
Vq,Vi , ,Vj, where Vj is such that there exists the edge (vo, Vj) and there is no 
edge ( vo,Vh ) with Vj <Vh- In other words j is the largest index such that there 
is an edge ( Vo,Vj ). (ii) If the sink vertex tj_i of block _B,_ i is not a special cut 
vertex then block Bi consists of the vertices Vk,Vk+i, ■ ■ ■ ,Vj with the following 
properties: there is an edge (■ Vk,Vj ); Vk < U- 1 ; there is no edge (v g ,Vh) with 
v g < ti - 1 and Vj < Vh\ there is no edge ( Vh,Vj ) with Vh < Vk- In other words j 
is the largest index such that there is an edge from a vertex smaller than to 
Vj and k is the smallest index such that there is an edge (vk, Vj). (iii) If the sink 
vertex ti- \ of block Bi-i is a special cut vertex then block Bi consists of the 
vertices Vk, Vk+i, ■ ■ ■ , Vj, where Vk = t*_i, and where Vj is such that there exists 
an edge u,) and there is no edge (ti—i,Vh) with h > j. In other words j is 

the largest index such that there is an edge from tj_i to Vj. 

An example of a block decomposition is illustrated in Figure 3. 




h r " - /ig - - -- r | . V ft r k — H'2 y /f ; — 



' Bi -f 'ir—fBs:. — 4- B s — 4 



Fig. 3. A block decomposition of an arched leveled planar graph. 



Algorithm 1 computes a 5-track layout of an arched leveled planar graph G. 



ALPTrackLayout(G) 

Input: 1-queue layout of connected graph G with A = 

Vo, Vl, . . . , Vn—l 

Output: A 5-track layout of G. 

Let to,ti,t 2 ,t 3 and £4 be five track-sets; 

Let Bo, . . . , Bk- 1 be a block decomposition of G; 
track(so) 0; 
for i = 0 to k — 1 
t <— track(si); 

foreach internal vertex v of Bi not assigned to any track-set 
track(v) <— (t + 1) mod 5; 
track(ti) «— (t + 2) mod 5; 
for i = 0 to n — 1 
order{vi) = i; 

Algorithm 1: Algorithm ALPTrackLayoutQ 
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Lemma 3. Let G be a connected graph with qn(G) = 1. Let A = Vq, v\, . . . , v n -\ 
be the linear ordering of a 1-queue layout of G. The track assignment computed 
by Algorithm 1 is such that for all edges ( v g ,Vh ) with g < h, either track{vh ) = 
( track(v g ) + 1) mod 5 or track(vh) = ( track(v g ) + 2) mod 5. 

Lemma 4. Let G be a graph with qn(G) = 1, then tn(G ) < 5. 

Proof. We first assume that G is connected. Suppose we have a 1-queue layout 
of G with A = vq,vi, . . . ,v n -\. We prove that Algorithm 1 computes a track 
layout of G. We have no overlaps since there are no edges ( Vi , Vj) with track{vf) = 
track(vj). Also we have no A-crossing. Consider two edges (v g ,Vh) and ( Vi,Vj ) 
such that v g and Vi are in a track-set a and Vh and Vj are in another track-set b. 

By Lemma 3, there are two cases: (i) if b = (a + 1) mod 5 or b = (a + 2) mod 5, 

then v g < Vh and Vi < Vj\ (ii) if b = (a — 1) mod 5 or b = (a — 2) mod 5, then 
v g > Vh and Vi > Vj. Consider case (i), and assume without loss of generality 
that v g < Vi . If Vj < Vh then v g < Vi < Vj < Vh in A, but this is not possible, 
because there would be two nested edges in the 1-queue layout. Hence Vh < v 3 
and therefore there is no A-crossing. With analogous arguments also case (ii) 
can be considered. □ 

Lemma 4 proves that every arched leveled planar graph G has track number 
at most 5. The following theorem summarize the results above and describe 
how to produce a 5-track drawing with volume 3x3 x n. 

Theorem 2. Every arched leveled planar graph has track number at most 5 and 
admits a 5-track drawing with volume at most 3x3 x n that can be computed 
in 0(n) time. Also, there exists an arched leveled planar graph G such that 
tn(G ) > 4. 

Proof. The bounds on the track number follow from Lemma 2 and 4. We describe 
now how to compute a track drawing with volume 3x3 x n. Consider the 
five tracks (x,2, 1), (a;, 0,1), (a;, 0,0), (a;, 2,0) and (x, 1,2) and denote them as 
T 0 ,Ti,T 2 ,T 3 , and X4, respectively. Compute a track layout with five track-sets 
using Algorithm 1. Let no, n\, n 2 , n 3 and n 4 be the number of vertices in track- 
set t 0 , t\, t 2 , to and t±, respectively. Draw the vertices assigned to track-set t. L 
on track Ti according to the total order defined in the track-set, so that they 
occupy ^-coordinates from Y^j=o n i Sj=o n o (* = 0, 1, 2, 3, 4). We prove that 
the drawing has no crossing. Overlaps and A-crossings are not possible, because 
there is no overlap and no A-crossing in the track layout. 

A crossing is possible only between edges that are on crossing strips. There 
are five pairs of such strips: (1) 002 and cri 3 ; (2) 001 and <T24;(3) cr 0 i and cr 3 4; 
(4) fjp2 and <t 3 4;(5) o i 3 and er 2 4. Let er^ and ahi be the two crossing strips 
of one of the five cases above and let ei and e2 be two edges on <7^ and ahi, 
respectively. Denote as Xj, Xj, Xh and Xi the x-coordinates of the vertices on 
tracks T), Tj, Th and T/. The co-planarity equations for each of the five cases 
are: (1) Xo + x 2 = Xi + x 3 (2) xq + 3xi = 2x 2 + 2x4 (3) 3xo + Xi = 2x 3 + 2x4 (4) 
4xo + x 2 = 3x 3 + 2x4 (5) 4xi + x 3 = 3x2 + 2x4. Since Xo < xi < X2 < x 3 < X4, 
none of the above equations has a solution, and therefore no crossing is possible. 
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The drawing algorithm consider a vertex per time and executes a constant 
number of operations for each vertex. The time complexity is then O(n). □ 

Theorem 2 shows that every arched leveled planar graph admits a 5-track 
layout. However, there are specific classes of graphs that have queue number 1 
and that admit a track layout with less than 5 track-sets. One of such class is 
that of square meshes. The following Theorem hold. Proof is omitted for reasons 
of space. 

Theorem 3. Every square mesh has track number at most 3 and admits a 3- 
track drawing with volume at most 2 x 2 x can & e computed in 0(n) 

time. Also, there exists a square mesh G such that tn(G) > 3. 

4 Graphs with 2 < qn < 3 

In this section we study the track number of graphs with queue number equal to 
2 or queue number greater than 2 and less than 3. X-trees have queue number 
2 [8]. The following theorem, whose proof is omitted for reasons of space, hold. 

Theorem 4. Every X-tree has track number at most 3 and admits a 3-track 
drawing with volume at most 2 x 2 x |(n + l) that can be computed in 0(n) time. 
Also, there exists an X-tree G such that tn{G) > 3. 

Halin Graphs [7] have queue number greater than 2 and less than 3 [6]. A 
Halin graph is a graph such that: (i) every vertex of G has degree greater or 
equal to 3; (ii) G can be decomposed into a spanning tree T of G and a cycle 
C through the leaves of T; (iii) G has a planar embedding in which C is the 
boundary of the external face. 

T is called the characteristic tree of G and C is called the adjoint cycle of G. 
Figure 4 shows a Halin graph. 




It is known that a graph admits a 2-track layout only if it is outerplanar [1] . 
Since Halin graphs are not outerplanar (every outerplanar graph has at least on 
vertex of degree two) a lower bound on the track number of Halin graphs is 3. 

We now describe an algorithm to compute a 4-track layout of a Halin graph. 
An external path of an embedded rooted ordered tree T is the path 7r = 717 U n r , 
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where 7 q is the path from the leftmost leaf of T to the root of T and 7r r is the 
path from the root of T to the rightmost leaf of T. Let v be a vertex in an 
external path 7r. If v has children that are not in 7r then every subtree rooted at 
a child of v not in 7r is called a dangling subtree of n. 

Let G be a Halin graph. Assume that G is embedded in the plane such that it 
is planar and its adjoint cycle C is the exterior face. Let T be the characteristic 
tree of G. T inherits its embedding from G. Arbitrarily choose one of the non-leaf 
vertices of T as the root. A level decomposition of T is an assignment of a level 
to each vertex v of T that is defined as follows (see Figure 5): (i) all the vertices 
on the external path of T are given level 0; (ii) Let 7r be an external path of level 
i. For any dangling subtree T' of n, the vertices on the external path of T' are 
given level % + 1. 



lev 0 
lev 1 
lev 2 
lev 3 



^14 



^15 .... 



v 13 ; vii 

4 

v 12 



v\Q 



v is, - " ^ - yij ~2 ~ 

^10 : v 4 v 3 v 2 

i v 19_ 

vg / ''-V5 



v 3 v 20 v 7 v 6 



VQ 



v\ 



Fig. 5. A level decomposition of the characteristic tree of the Halin graph in Figure 4. 



Let 7T be an external path of any level. Let To, Ti, . . . , T^—i be the dangling 
subtrees of 7 r. We define a natural ordering of the dangling subtrees as follows: 
(i) Tq, Tj, . . . , T h _! are ordered from left to right according to their parents order 
in 7r; (ii) dangling subtrees that have the same parent are ordered from left to 
right. A consequence of the level decomposition is the following. 

Property 1. Let G be a Halin graph and let 7r be an external path of any level 
with at least one dangling subtree. Let To,Ti, . . . , T/j_ 1 be the natural order of 
the dangling subtrees of 7 r. Then in the adjoint cycle C of G: (i) the leftmost leaf 
of 7r is adjacent to the leftmost leaf of To; (ii) the rightmost leaf of 7r is adjacent 
to the rightmost leaf of Ti_i; (iii) the rightmost leaf of T ; is adjacent to the 
leftmost leaf of T j+ i (j = 0,. — 2). 

Since the embedding of G is not changed when T is rooted, and since the 
boundary of the external face of G is C, then there exist an edge of C that 
connects the leftmost leaf of T to the rightmost leaf of T. The edge connecting 
the leftmost leaf of T and the rightmost leaf of T is called the long edge. Let 7r 
be an external path without dangling subtrees; if 7 r consists of three vertices u , 
v and w in this order and u and w are leafs of T, then u and w are connected by 
an edge of C and this edge is called a short edge. The long edge and the short 
edges are generically called overlapping edges. 

We now describe an algorithm to compute a 3-track layout of a Halin graph 
without overlapping edges. Later we will describe how the overlapping edges can 
be added back to the track layout using a fourth track-set. A Halin graph after 
the deletion of the overlapping edges is called a reduced Halin graph. 
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RHTrackLayout(G) 

Input: An embedded reduced Halin graph G 
Output: A 3-track layout of G. 

Let T be the characteristic tree of G, rooted at any vertex r; 

Q <— new queue (); 

Q ,enqueue(r); 
ord 4— 0; 

while Q is not empty 
v 4— Q. dequeue (); 
if v = r then t -4— 0; 

else t 4— ( track(parent(v )) + 1) mod 3; 

Let T' be the subtree rooted at v; 

Let n = vo,vi, . . . ,Vh-i be the external path of T'\ 
for i = 0 to h — 1 
track(vi) 4— t; 
order(vi) 4— ord\ 
ord 4— ord + 1; 

Let wo,Wi, . . . ,Wk-i be the children of Vi not in 7r ordered 
from left to right; 
for j = 0 to k — 1 
Q.enqueue{wj)\ 

Algorithm 2: Algorithm RHTrackLayoutQ 



Lemma 5. Let G be a reduced Halin graph. Algorithm, 2 computes a 3-track 
layout of G. 

Proof. We prove that the track assignment computed by Algorithm 2 has no 
overlaps nor X-crossing. The edges having both vertices in a same track-set are 
either edges of an external path or edges of the adjoint cycle connecting leafs 
of the same level. The edges of the external paths do not overlap since the two 
vertices of each edge are consecutive in a track-set. Let n be an external path of 
level i and let To, Tj, . . . , Th-i be the natural ordering of the dangling subtrees 
of 7 r. By Property 1 the edges of the adjoint cycle connecting two leafs of the 
same level z+1 connect the rightmost leaf of Tj to the leftmost leaf of Tj +i . Since 
Algorithm 2 lays out To,Ti, . . . , T/,_i according to their natural order, then the 
rightmost leaf of Tj and the leftmost leaf of T) + i are consecutive in a track-set 
and therefore they do not overlap. 

Let eo = (uo,vq) and e\ = (ui,ui) be two edges having the two vertices in 
two different track-sets and assume that uq and u\ are in the same track-set 
with uq < u\. Edges eo and t\ are either edges connecting the roots of dangling 
trees to their parents or edges of the adjoint cycle connecting leafs at consecutive 
levels. If Uq and ui are in different external path 7r 0 and 7r i , then uq is adjacent 
to a vertex (a leaf or the root) of a dangling subtree of 7To and u\ is adjacent 
to a vertex (a leaf or the root) of a dangling subtree of 7Ti. The vertices of the 
dangling subtrees of 7r 0 precede the vertices of the dangling subtree of 7Ti in each 
track-set. Therefore uq < v\ and an X-crossing is not possible. Assume uq and 
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Mi be in the same external path tt. If Mo is the leftmost vertex of tt then vg is the 
leftmost leaf of the first dangling subtree T 0 of tt (Property 1). It follows that 
Mo < Mi and an X-crossing is not possible also in this case. If u\ is the rightmost 
vertex of tt then mi is the is the rightmost leaf of the last dangling subtree Th-i 
of tt (Property 1). It follows that vg < Mi and an X-crossing is not possible. If ug 
is not the leftmost vertex of tt and Mi is not the rightmost vertex of it, then they 
are adjacent to the roots of two dangling subtrees of tt. Since Mo < Mi and since 
the dangling subtree are laid out according to their natural order, then Vg < mi 
and an X-crossing is not possible. □ 

Notice that the track layout of the Reduced Halin Graph is such that the 
vertices of each overlapping edge are both in the same track-set. The track layout 
of the reduced Halin graph of the graph in Figure 4 is shown in Figure 6 . The long 
edge (vg, M14) and the short edges {v2,vg) and (v?,v$) would create an overlap if 
considered in the track layout. 




”12 Vg V lg vg 



Fig. 6. A track layout of the reduced Halin graph of the graph in Figure 4. 



Lemma 6. Let G be a Halin graph and let G' the corresponding reduced Halin 
graph. Let "f(G') be a 3-track layout of G' computed by Algorithm 2 and let 
eg = (ito,uo) and ei = (mi,mi) be any pair of overlapping edges. The two edges 
eg and e\ do not have any vertex in common, and if their vertices are in the 
same track-set then either ug < u\ < vo < v± or u\ < ug < v\ < Vg. 

By Lemma 6 it is easy to see that a track layout of a Halin graph can be 
computed from a 3-track layout of a reduced Halin graph by adding a new track- 
set and moving the left vertex of each overlapping edge in the new track-set. 

Lemma 7. Let G be a Halin graph, then tn(G) < 4. 

Proof. Let 7 (G) the 3-track layout of the reduced Halin graph G' of G computed 
by Algorithm 2. Denote as tg,t\, and £2 the three track-set used and consider a 
new track-set denoted as £ 3 . For each overlapping edge e = (m,m) assume that 
u < v and set track(u) = 3, i.e. for each overlapping edge one of the vertices 
is assigned to the new track-set. This change of track-set do not introduce an 
overlap since no edge has both vertices in track-set £4. Also, no X-crossing is 
introduced. Namely, consider the the pair of track-sets consisting of track £4 and 
of any of the other three track-sets (denote this track-set as tf). The edges whose 
vertices are in this pair are the overlapping edges whose vertices were originally 
in Let eg = (ug, vg) and ei = (u±, Mi) be two of these edges and assume that ug 
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and Mi are in track-set to with uq < mi. By Lemma 6 we have uq < vq < u± < vi 
in track-set tj in 7 (G'). Since order(uo ), order(v 0), order(u 1), and order{ 17) are 
not changed, we have Uq < Ui in track-set t 3 and Vq < 17 in track-set ij. So there 
is no X-crossing. □ 

Theorem 5. Every Halin graph has track number at most 4 cmd admits a 4~ 
track drawing with volume at most 2x2 x n that can be computed in 0(n) time. 
Also, for every Halin graph tn(G) > 3. 

Proof. The results about the bounds on track number follows from the fact that 
Halin graphs are not outerplanar and by Lemma 7. We prove the result about 
volume. Consider the four tracks (a;, 0,0), (x, 1,0), (a;, 0,1) and (a;, 1,1), and 
denote them as To,Ti,T 2 , and T3, respectively. Compute a 4-track layout on four 
track-sets as described above . Let no,ni,n% and 713 be the number of vertices 
on track-set to, ti, t% and to, respectively. Draw the vertices assigned to track-set 
ti on track T* according to the total order defined in the track-set, so that they 
occupy ^-coordinates from X^j=o n :i t° 0 n i (* = 1> 2, 3, 4). We prove that 

the drawing has no crossing. Overlap and X-crossings are not possible, because 
there is no overlap and no X-crossing in the track layout. A crossing is then 
possible only between edges on crossing strips. There are only two such strips 
CT02 and fj 13. The co-planarity equation of an edge on 002 and an edge on <713 is 
Xi + X3 = Xq + X2- Since Xq < X\ < X2 < £3 then the equation has no solution, 

i.e. a crossing is not possible. It follows that the drawing is a track drawing 
with volume 2x2 x n. The drawing algorithm consider a vertex per time and 
executes a constant number of operations for each vertex. The time complexity 
is therefore 0(n). □ 
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Abstract. This paper continues the study of 3D visibility representa- 
tions of complete graphs where vertices are represented by equal convex 
polygons lying in planes parallel to the xy-plane. Edges correspond to 
the s-parallel visibility among these polygons. 

We give several bounds on the size of the largest complete graph that 
has a 3D visibility representation with particular properties. Namely we 
improve the best known lower bound for representations by regular n- 
gons from + 2 to n+1 and the upper bound from 2 2 to ( 3 ^'Z'i) —3. 

1 Introduction 

In this paper we study visibility representations of graphs. Two-dimensional vari- 
ants of these representations were intesively studied in the past because of their 
connection to the VLSI routing and circuit board layout [8], [10]. Particularly the 
rectangle visibility graphs (axis-aligned rectangles in the plane with axis-aligned 
visibility segments among them) received wide attention (see for example [5] and 
[7]) in the last decade. 

It is natural to generalize these representations into three dimensions. Two- 
dimensional Euclidean sets act as vertices there. The sets are situated in the 
planes parallel to the xy-plane. Two vertices are connected if and only if they 
can see each other in the direction that is orthogonal to their planes. 

Fekete et al. study representations by squares and rectangles. In [6] and [9] it 
is shown that K 7 can be represented by equal squares but K% cannot. Similarly 
K 22 has a representation by rectangles of different shapes but K$q has not. Alt, 
Godeau and Whitesides [1] prove that there doesn’t exist a convex polygon P 
such that any complete graph has a visibility representation by shifted copies 
of P. Particularly K m for m > 2 2 cannot be represented by convex n-gons. 
They also describe a representation of any graph on n vertices by (possibly 
different) polygons with at most 2 n sides. Babilon et al. [2] are interested in 
the maximum size of a clique represented by regular n-gons and present a lower 
bound + 2. 

Cobos et al. [4] extend visibility representations even into the higher dimen- 
sions and characterize the dimensionality (e.g. the lowest dimension in which 
a representation of the given graph exists) of several classes of graphs. 

G. Liotta (Ed.): GD 2003, LNCS 2912, pp. 226-237, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 
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We continue in the study of the maximum size of a complete graph with a vis- 
ibility representation where vertices are represented by shifted copies of a given 
(open) polygon. It can be easily shown (see for example [1]) that a complete 
graph of an arbitrary size can be represented by any non-convex polygon. So, 
our attention is restricted to convex polygons only. Therefore the term polygon 
means convex polygon in the sequel. 

Let s(P) denote the maximum size of a complete graph with a visibility 
representation by copies of the polygon P. Denote by R n a regular n-gon. It was 
shown in [1] and [2] that + 2 < s(R n ) < 2 2 ". We improve both sides of 

this estimate. 

2 Coordinates 

Let’s remind the notion of polygon coordinates in a visibility representation 
(introduced in [2]). Firstly note that because of the openness of the polygons 
used in the representation we can expect that no two projections (into the plane 
z — 0) of sides of the polygons lie in the same line. Let P be a projection of some 
polygon into the plane z = 0. Denote by itj, i £ {1, . . . ,n} the unitary vector 
orthogonal to the i- th side of P. We choose the orientation of Ui such that Ui 
aims out of P. 

Note that the definition of Ui doesn’t depend on the selection of P because 
all polygons in the representation are shifted copies of the same polygon. 

Now consider projections of i-th sides of all polygons from the representation. 
The i-th coordinate of the polygon P is the integer number that denotes the 
order (in the direction of Ui) of the projection of the i-th side of P among the 
projections of i-th sides of the other polygons. 

The polygon coordinates allow us to recognize configurations that cannot 
appear in a representation of a complete graph. 

Lemma 1. [2] Let P,Q,R be n-gons whose i-th coordinates are monotone for 
every i £ {l,...,n}. If the z-coordinates of those polygons are also monotone 
then P cannot see R along the z-axis. 

3 Relationships 

It is important to understand that individual coordinates of polygons are not 
independent. 

Lemma 2. Let P and Q be polygons from a visibility representation. Let (p ±, . . . , 
p n ) and (gi, . . . , q n ) be their coordinates. There exist i,j £ {1, . . . , n }, i < j such 
that one of the following conditions is satisfied: 

— pk < qk for k £ {*,... ,j — 1} and pk > qk otherwise 

— Pk > qk for k £ {i, ... ,j — 1} and pk < qk otherwise 
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Proof. Let Q' be a projection of Q into the plane of P. Denote by u the trans- 
lation vector from P to Q' and by Ik the line that goes through the origin and is 
parallel to the fc-th side of P. Let Uk be the vector from the definition of polygon 
coordinates. The line Ik (that is orthogonal to Uk) divides the plane into two 
half planes. We call them positive and negative depending on the fact whether 
Uk points into them or not. It is clear that pk < qu if and only if u points into 
the positive half plane. 

Now consider all lines Ik together. The polygon P is convex. Therefore one 
of the following two possibilities must occur. 

— There exist i and j such that u lies in the positive half planes that correspond 
to lines Ik, i < k < j and in the negative half planes that correspond to the 
other lines. 

— There exist i and j such that u lies in the negative half planes that correspond 
to lines Ik, i < k < j and in the positive half planes that correspond to the 
other lines. 

And that is a reformulation of what we want to prove. □ 



Definition 1. Let P and Q be polygons in a visibility representation. Denote by 
i and j the numbers from the previous lemma. We say that P has a relationship 
(i,j) to Q if the first condition is satisfied. Otherwise we say that P has to Q 
a relationship ( j,i ). Finally the relationship between P and Q is the relationship 
of the polygon with the smaller z-coordinate to the other one. 

Consider relationships in a representation by the given polygon P. Each 
relationship corresponds to one part of the plane divided by cuts (going through 
the origin) that are parallel to the individual sides of P. For example we have 2 n 
relationships for a general n-gon or for a regular 2n-gon. We denote this number 
by r(P). 

Now we can reformulate Lemma 1. in terms of relationships. 

Lemma 3. Let P,Q,R be polygons from a visibility representation. If P has 
to Q the same relationship as Q has to R then if Q lies ( according to the z- 
coordinate) between P and R then P cannot see R. 

4 Upper Bounds 

In this section we present several upper bounds on the size of a complete graph 
with a visibility representation with particular properties. Firstly we improve 
the upper bound for s(R n ) by using the notion of relationships. 

Theorem 1. If P is a polygon then s(P) < [_e.r(P)!J. 

Proof. We proceed by induction on r(P). Let’s suppose that there are k + 1 
relationships in the given visibility representation. Divide polygons into sets 
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Mi, . . . , Mfc + i according to their relationship to the polygon with the maximum 
^-coordinate. 

If r is a relationship that corresponds to the set M* then there are no two poly- 
gons in Mi with this relationship. Otherwise these two polygons together with 
the one with the maximum ^-coordinate would fulfill presumptions of Lemma 3. 
Hence, there are at most k types of relationships among polygons in Mi. There- 
fore \Mi\ < [e.fclj and there is at most 1 + (k + 1) \e.k\\ = [e.(k + 1) !J polygons 
in the whole representation. 

It remains to prove the simplest case k = 1. If there is only one type of 
relationship in the representation then there can be at most 2 = [_e.l!J polygons. 
Otherwise we would be able to apply Lemma 3. to any triple of polygons. □ 

Later in the paper we derive an estimate with a better asymptotic behavior 
than the previous one but the previous estimate of s(R n ) gives the best known 
upper bounds for n equal to 5, 6, 8 and 10. 

For the sake of simplicity we work in the rest of this paper with objects in 
different parallel planes as if they are in the same plane. From the formal point 
of view such a description should be understood as an operation on projections 
into a common plane and projections of a result back to the individual planes. 

In the rest of this section we denote by Xi the first vertex of the *-th polygon 
from the representation. We also say that polygons Pi, . . . , Pk are in 2 -order if 
they are ordered according to their increasing 2 -coordinate. 

Lemma 4. Let polygons Pi , . . . , Pk (in z-order) form a visibility representation 
of Kk- If X i . . . XkXi form a boundary of a convex polygon then k < r(Pi) + 2. 

Proof. Denote by Ui the vector Xi + i — Xi. X 1 X 2 . . . X^Xi is a boundary of 
a convex polygon. Therefore is a sequence of vectors that rotate around 

the origin by less than 27r. 

The vector n, is clearly a translation vector from P,; to P t +i- This vector de- 
termines the relationship between P t and P i+ 1 . Because of the rotation around 
the origin and Lemma 3. there cannot be two equal relationships between poly- 
gons Pi and Pi + 1 , with one exception - the relationship between Pi and P 2 could 
be the same as the relationship between Pk-i and Pk- Hence k < r(Pi) + 2. □ 

Regular (2 n + l)-gons can have two types of relationships (*, j): (j — i) mod 
(2n+l) could be either n or n+ 1. For (4n+l)-gons the first situation corresponds 
to the pull of the upper polygon by a vertex and the second to the pull by an 
edge. The opposite holds for (4n+3)-gons. That is why we call such relationships 
edge-relationships and vertex-relationships. 

Definition 2. Let V be a vertex of a regular (2 n + 1 )-gon P with the center Sp. 
Denote by P' the copy of P shifted by the vector SpV . We call the relationship 
of P to P' V -relationship (or the relationship V). 

Let E be an edge of P. If we replace V in the previous paragraph by the 
center of E we get the definition of E -relationship (the relationship E). The 
following two lemmas give us the basic properties of these relationships. 
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Lemma 5. Let P,Q,R (in z-order) be regular (4 n + 1 )-gons from a visibility 
representation. Denote by V. (Ei) the i-th vertex (edge) of P. If P and Q have 
Ei-relationship and Q and R have Vi+i-relationship then P and R cannot see 
each other. 

Proof. Let’s rotate the whole situation such that the vector ViV i+ 1 is parallel 
to the vector (1,0). Denote by pi ( p r ) the line that contains i ( E i+n+ i ) 

- the edge going from the vertex with the smallest (biggest) ^-coordinate that 
goes away from Ei. It is clear that P and R cannot see each other if the trans- 
lation vector between Q and R is lying between p r and pi , but the translation 
vector that corresponds to V) +1 -relationship has to lie between V)_|_„ + 2 V)+„+i 
and Vi— n Vi— n ^-i . D 




Lemma 6. Let P,Q,R (in z-order) be regular (4 n + 3)-gons from a visibility 
representation. Denote by Vi (Ei) the i-th vertex (edge) of P. If P and Q have 
Vi-relationship and Q and R have Ei-relationship then P and R cannot see each 
other. 



Proof. We proceed in the same way as in the previous proof. 



□ 
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Lemma 4. gives an upper bound 2n + 2 for regular 2n-gons. Using Lemmas 
5. and 6. we can prove the same bound also for regular (2n + l)-gons. 

Lemma 7. Let regular (2 n + 1 )-gons Pi, . . . , P& (in z-order) form a visibility 
representation of Kk- If the centers of the polygons form a boundary of a convex 
polygon then k < 2n + 2. 

Proof. We prove the lemma for (4 n + l)-gons. The proof for (4n + 3)-gons is 
similar. 

Using the same argument as in the proof of Lemma 4. we can show that 
there cannot be the same relationships between polygons P, and Pj + i. Again 
with one exception - the relationship between Pi and P 2 could be the same as 
the relationship between Pk-i and Pk- 

Without loss of generality we can assume that the vertices and edges are 
numbered such that the relationship between Pi and P 2 is Vi or E\ and that the 
sequence of relationships between P, and P i+ i is a subsequence of V 1 E 1 V 2 . ■ . 
E m V 1 or E 1 V 2 E 2 ■ ■ ■ E m ViEi, where m = 4n + 1. Due to Lemma 5. we know 
that this sequence doesn’t contain any pair P,;U;+i ■ 

Hence, it is sufficient to show that a subsequence avoiding mentioned pair 
that is chosen 

— from E 1 V 2 ■ ■ ■ E m Vi has length at most m, 

— from V 1 E 1 V 2 ■ ■ ■ E m V 1 has length at most m + 1. 

We proceed by induction, for m = 1 the claim holds because we have to leave 
at least one relationship from the sequences P 1 V 2 and ViEiV 2 . 

Let’s consider what can remain from the first two relationships in the first 
case. If we omit E\ then by the second case the length of the subsequence is at 
most to. If we omit V 2 we get the first case for a shorter sequence and again the 
length at most to. 

In the second case we have to consider the first three relationships. If we omit 
Vi then by the first case the length of the subsequence is at most m. If we omit 
Ei then we get the second case for a shorter sequence and the length at most 
to + 1. Finally if we omit V 2 we get again the length at most to + 1 by the first 
case. □ 

We demanded that corresponding points of polygons lie in a convex position 
and that polygons are ordered according to their ^-coordinates. Let’s see how 
the situation changes when we remove the second requirement. 

Lemma 8. Let polygons Pi,...,Pk form a visibility representation of Kk- If 
X\ . . . XkX 1 form a boundary of a convex polygon then k < 3r(Pi). 

Proof. Let iq has the same meaning as in the proof of Lemma 4. Denote by r,; the 
relationship that corresponds to the translation vector Ui. (uf)^ ' =1 is a sequence of 
vectors that rotate around the origin by less than 27 t. Therefore the relationships 



